[HTML][HTML] Hallucinations in large multilingual translation models

NM Guerreiro, DM Alves, J Waldendorf… - Transactions of the …, 2023 - direct.mit.edu
Hallucinated translations can severely undermine and raise safety issues when machine
translation systems are deployed in the wild. Previous research on the topic focused on …

Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding

R Sennrich, J Vamvas, A Mohammadshahi - arXiv preprint arXiv …, 2023 - arxiv.org
Hallucinations and off-target translation remain unsolved problems in machine translation,
especially for low-resource languages and massively multilingual models. In this paper, we …

Codetransocean: A comprehensive multilingual benchmark for code translation

W Yan, Y Tian, Y Li, Q Chen, W Wang - arXiv preprint arXiv:2310.04951, 2023 - arxiv.org
Recent code translation techniques exploit neural machine translation models to translate
source code from one programming language to another to satisfy production compatibility …

Memory-efficient nllb-200: Language-specific expert pruning of a massively multilingual machine translation model

Y Koishekenov, A Berard, V Nikoulina - arXiv preprint arXiv:2212.09811, 2022 - arxiv.org
The recently released NLLB-200 is a set of multilingual Neural Machine Translation models
that cover 202 languages. The largest model is based on a Mixture of Experts architecture …

Extending multilingual machine translation through imitation learning

W Lai, V Hangya, A Fraser - arXiv preprint arXiv:2311.08538, 2023 - arxiv.org
Despite the growing variety of languages supported by existing multilingual neural machine
translation (MNMT) models, most of the world's languages are still being left behind. We aim …

Tuning llms with contrastive alignment instructions for machine translation in unseen, low-resource languages

Z Mao, Y Yu - arXiv preprint arXiv:2401.05811, 2024 - arxiv.org
This article introduces contrastive alignment instructions (AlignInstruct) to address two
challenges in machine translation (MT) on large language models (LLMs). One is the …

Target-agnostic gender-aware contrastive learning for mitigating bias in multilingual machine translation

M Lee, H Koh, K Lee, D Zhang, M Kim… - arXiv preprint arXiv …, 2023 - arxiv.org
Gender bias is a significant issue in machine translation, leading to ongoing research efforts
in developing bias mitigation techniques. However, most works focus on debiasing bilingual …

Multilingual Distilwhisper: Efficient Distillation of Multi-Task Speech Models Via Language-Specific Experts

TP Ferraz, MZ Boito, C Brun… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Whisper is a multitask and multilingual speech model covering 99 languages. It yields
commendable automatic speech recognition (ASR) results in a subset of its covered …

Intriguing properties of compression on multilingual models

K Ogueji, O Ahia, G Onilude, S Gehrmann… - arXiv preprint arXiv …, 2022 - arxiv.org
Multilingual models are often particularly dependent on scaling to generalize to a growing
number of languages. Compression techniques are widely relied upon to reconcile the …

Knowcomp submission for wmt23 word-level autocompletion task

Y Wu, H Shi, W Wang, Y Song - Proceedings of the Eighth …, 2023 - aclanthology.org
The NLP community has recently witnessed the success of Large Language Models (LLMs)
across various Natural Language Processing (NLP) tasks. However, the potential of LLMs …