End-to-end speech-to-text translation: A survey

N Sethiya, CK Maurya - Computer Speech & Language, 2024 - Elsevier
Abstract Speech-to-Text (ST) translation pertains to the task of converting speech signals in
one language to text in another language. It finds its application in various domains, such as …

Learning when to translate for streaming speech

Q Dong, Y Zhu, M Wang, L Li - arXiv preprint arXiv:2109.07368, 2021 - arxiv.org
How to find proper moments to generate partial sentence translation given a streaming
speech input? Existing approaches waiting-and-translating for a fixed duration often break …

Direct simultaneous speech-to-text translation assisted by synchronized streaming ASR

J Chen, M Ma, R Zheng, L Huang - arXiv preprint arXiv:2106.06636, 2021 - arxiv.org
Simultaneous speech-to-text translation is widely useful in many scenarios. The
conventional cascaded approach uses a pipeline of streaming ASR followed by …

Segmentation-Free Streaming Machine Translation

J Iranzo-Sánchez, J Iranzo-Sánchez… - Transactions of the …, 2024 - direct.mit.edu
Abstract Streaming Machine Translation (MT) is the task of translating an unbounded input
text stream in real-time. The traditional cascade approach, which combines an Automatic …

Blockwise streaming transformer for spoken language understanding and simultaneous speech translation

K Deng, S Watanabe, J Shi, S Arora - arXiv preprint arXiv:2204.08920, 2022 - arxiv.org
Although Transformers have gained success in several speech processing tasks like spoken
language understanding (SLU) and speech translation (ST), achieving online processing …

Beyond sentence-level end-to-end speech translation: Context helps

B Zhang, I Titov, B Haddow… - Proceedings of the 59th …, 2021 - aclanthology.org
Document-level contextual information has shown benefits to text-based machine
translation, but whether and how context helps end-to-end (E2E) speech translation (ST) is …

The effectiveness of computer-assisted interpreting: A preliminary study based on English-Chinese consecutive interpreting

S Chen, JL Kruger - Translation and Interpreting Studies, 2023 - jbe-platform.com
Facing a new technological turn, the field of interpreting is in great need of evidence on the
effectiveness of computer-assisted interpreting. This study proposes a computer-assisted …

Low-latency sequence-to-sequence speech recognition and translation by partial hypothesis selection

D Liu, G Spanakis, J Niehues - arXiv preprint arXiv:2005.11185, 2020 - arxiv.org
Encoder-decoder models provide a generic architecture for sequence-to-sequence tasks
such as speech recognition and translation. While offline systems are often evaluated on …

[PDF][PDF] Artificial Intelligence Based Language Translation Platform.

M Kolhar, A Alameen - Intelligent Automation & Soft Computing, 2021 - cdn.techscience.cn
The use of computer-based technologies by non-native Arabic-speaking teachers for
teaching native Arabic-speaking students can result in higher learner engagement. In this …

Lost in interpreting: Speech translation from source or interpreter?

D Macháček, M Žilinec, O Bojar - arXiv preprint arXiv:2106.09343, 2021 - arxiv.org
Interpreters facilitate multi-lingual meetings but the affordable set of languages is often
smaller than what is needed. Automatic simultaneous speech translation can extend the set …