SMaLL-100: Introducing shallow multilingual machine translation model for low-resource languages

NM Guerreiro, DM Alves, J Waldendorf… - Transactions of the …, 2023 - direct.mit.edu

Hallucinated translations can severely undermine and raise safety issues when machine
translation systems are deployed in the wild. Previous research on the topic focused on …

被引用次数：87 相关文章所有 11 个版本

[PDF] arxiv.org

Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding

R Sennrich, J Vamvas, A Mohammadshahi - arXiv preprint arXiv …, 2023 - arxiv.org

Hallucinations and off-target translation remain unsolved problems in machine translation,
especially for low-resource languages and massively multilingual models. In this paper, we …

被引用次数：18 相关文章所有 3 个版本

[PDF] arxiv.org

Codetransocean: A comprehensive multilingual benchmark for code translation

W Yan, Y Tian, Y Li, Q Chen, W Wang - arXiv preprint arXiv:2310.04951, 2023 - arxiv.org

Recent code translation techniques exploit neural machine translation models to translate
source code from one programming language to another to satisfy production compatibility …

被引用次数：14 相关文章所有 5 个版本

[PDF] arxiv.org

Memory-efficient nllb-200: Language-specific expert pruning of a massively multilingual machine translation model

Y Koishekenov, A Berard, V Nikoulina - arXiv preprint arXiv:2212.09811, 2022 - arxiv.org

The recently released NLLB-200 is a set of multilingual Neural Machine Translation models
that cover 202 languages. The largest model is based on a Mixture of Experts architecture …

被引用次数：14 相关文章所有 6 个版本

[PDF] arxiv.org

Extending multilingual machine translation through imitation learning

W Lai, V Hangya, A Fraser - arXiv preprint arXiv:2311.08538, 2023 - arxiv.org

Despite the growing variety of languages supported by existing multilingual neural machine
translation (MNMT) models, most of the world's languages are still being left behind. We aim …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Tuning llms with contrastive alignment instructions for machine translation in unseen, low-resource languages

Z Mao, Y Yu - arXiv preprint arXiv:2401.05811, 2024 - arxiv.org

This article introduces contrastive alignment instructions (AlignInstruct) to address two
challenges in machine translation (MT) on large language models (LLMs). One is the …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Target-agnostic gender-aware contrastive learning for mitigating bias in multilingual machine translation

M Lee, H Koh, K Lee, D Zhang, M Kim… - arXiv preprint arXiv …, 2023 - arxiv.org

Gender bias is a significant issue in machine translation, leading to ongoing research efforts
in developing bias mitigation techniques. However, most works focus on debiasing bilingual …

被引用次数：4 相关文章所有 4 个版本

[PDF] hal.science

Multilingual Distilwhisper: Efficient Distillation of Multi-Task Speech Models Via Language-Specific Experts

TP Ferraz, MZ Boito, C Brun… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

Whisper is a multitask and multilingual speech model covering 99 languages. It yields
commendable automatic speech recognition (ASR) results in a subset of its covered …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Intriguing properties of compression on multilingual models

K Ogueji, O Ahia, G Onilude, S Gehrmann… - arXiv preprint arXiv …, 2022 - arxiv.org

Multilingual models are often particularly dependent on scaling to generalize to a growing
number of languages. Compression techniques are widely relied upon to reconcile the …

被引用次数：6 相关文章所有 3 个版本

[PDF] aclanthology.org

Knowcomp submission for wmt23 word-level autocompletion task

Y Wu, H Shi, W Wang, Y Song - Proceedings of the Eighth …, 2023 - aclanthology.org

The NLP community has recently witnessed the success of Large Language Models (LLMs)
across various Natural Language Processing (NLP) tasks. However, the potential of LLMs …

被引用次数：1 相关文章所有 3 个版本

高级搜索

QQ 群