Sequence-to-sequence models can directly translate foreign speech

RJ Weiss, J Chorowski, N Jaitly, Y Wu… - arXiv preprint arXiv …, 2017 - arxiv.org
We present a recurrent encoder-decoder deep neural network architecture that directly
translates speech in one language into text in another. The model does not explicitly …

Direct speech-to-speech translation with a sequence-to-sequence model

Y Jia, RJ Weiss, F Biadsy, W Macherey… - arXiv preprint arXiv …, 2019 - arxiv.org
We present an attention-based sequence-to-sequence neural network which can directly
translate speech from one language into speech in another language, without relying on an …

Speech translation and the end-to-end promise: Taking stock of where we are

M Sperber, M Paulik - arXiv preprint arXiv:2004.06358, 2020 - arxiv.org
Over its three decade history, speech translation has experienced several shifts in its
primary research themes; moving from loosely coupled cascades of speech recognition and …

[PDF][PDF] An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research.

S Nießen, FJ Och, G Leusch, H Ney - LREC, 2000 - www-i6.informatik.rwth-aachen.de
In this paper we present a tool for the evaluation of translation quality. First, the typical
requirements of such a tool in the framework of machine translation (MT) research are …

Statistical approaches to computer-assisted translation

S Barrachina, O Bender, F Casacuberta… - Computational …, 2009 - direct.mit.edu
Current machine translation (MT) systems are still not perfect. In practice, the output from
these systems needs to be edited to correct errors. A way of increasing the productivity of the …

N-gram-based Machine Translation

JB Marino, RE Banchs, JM Crego… - Computational …, 2006 - direct.mit.edu
This article describes in detail an n-gram approach to statistical machine translation. This
approach consists of a log-linear combination of a translation model based on n-grams of …

Multimodal machine translation through visuals and speech

U Sulubacak, O Caglayan, SA Grönroos, A Rouhe… - Machine …, 2020 - Springer
Multimodal machine translation involves drawing information from more than one modality,
based on the assumption that the additional modalities will contain useful alternative views …

Speech translation: Coupling of recognition and translation

H Ney - 1999 IEEE International Conference on Acoustics …, 1999 - ieeexplore.ieee.org
In speech translation, we are faced with the problem of how to couple the speech
recognition process and the translation process. Starting from the Bayes decision rule for …

[PDF][PDF] An efficient method for determining bilingual word classes

FJ Och - Ninth Conference of the European Chapter of the …, 1999 - aclanthology.org
In statistical natural language processing we always face the problem of sparse data. One
way to reduce this problem is to group words into equivalence classes which is a standard …

[PDF][PDF] Statistical machine translation: From single word models to alignment templates

FJ Och - 2003 - d-nb.info
In this work, new approaches for machine translation using statistical methods are
described. In addition to the standard source-channel approach to statistical machine …