Survey of low-resource machine translation

B Haddow, R Bawden, AVM Barone, J Helcl… - Computational …, 2022 - direct.mit.edu
We present a survey covering the state of the art in low-resource machine translation (MT)
research. There are currently around 7,000 languages spoken in the world and almost all …

Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages

G Ramesh, S Doddapaneni, A Bheemaraj… - Transactions of the …, 2022 - direct.mit.edu
We present Samanantar, the largest publicly available parallel corpora collection for Indic
languages. The collection contains a total of 49.7 million sentence pairs between English …

Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding

R Sennrich, J Vamvas, A Mohammadshahi - arXiv preprint arXiv …, 2023 - arxiv.org
Hallucinations and off-target translation remain unsolved problems in machine translation,
especially for low-resource languages and massively multilingual models. In this paper, we …

A closer look at few-shot crosslingual transfer: The choice of shots matters

M Zhao, Y Zhu, E Shareghi, I Vulić, R Reichart… - arXiv preprint arXiv …, 2020 - arxiv.org
Few-shot crosslingual transfer has been shown to outperform its zero-shot counterpart with
pretrained encoders like multilingual BERT. Despite its growing popularity, little to no …

Improving zero-shot translation by disentangling positional information

D Liu, J Niehues, J Cross, F Guzmán, X Li - arXiv preprint arXiv …, 2020 - arxiv.org
Multilingual neural machine translation has shown the capability of directly translating
between language pairs unseen in training, ie zero-shot translation. Despite being …

Informative language representation learning for massively multilingual neural machine translation

R Jin, D Xiong - arXiv preprint arXiv:2209.01530, 2022 - arxiv.org
In a multilingual neural machine translation model that fully shares parameters across all
languages, an artificial language token is usually used to guide translation into the desired …

An empirical investigation of word alignment supervision for zero-shot multilingual neural machine translation

A Raganato, R Vázquez, M Creutz… - Proceedings of the …, 2021 - aclanthology.org
Zero-shot translations is a fascinating feature of Multilingual Neural Machine Translation
(MNMT) systems. These MNMT models are usually trained on English-centric data, ie …

Adapting to Non-Centered Languages for Zero-shot Multilingual Translation

Z Qu, T Watanabe - arXiv preprint arXiv:2209.04138, 2022 - arxiv.org
Multilingual neural machine translation can translate unseen language pairs during training,
ie zero-shot translation. However, the zero-shot translation is always unstable. Although …

Language tokens: A frustratingly simple approach improves zero-shot performance of multilingual translation

M ElNokrashy, A Hendy, M Maher, M Afify… - arXiv preprint arXiv …, 2022 - arxiv.org
This paper proposes a simple yet effective method to improve direct (X-to-Y) translation for
both cases: zero-shot and when direct data is available. We modify the input tokens at both …

Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU

H Amer, YJ Kim, M Afify, H Matsushita… - arXiv preprint arXiv …, 2022 - arxiv.org
Multilingual Neural Machine Translation has been showing great success using transformer
models. Deploying these models is challenging because they usually require large …