The unreasonable effectiveness of few-shot learning for machine translation

X Garcia, Y Bansal, C Cherry, G Foster… - International …, 2023 - proceedings.mlr.press
We demonstrate the potential of few-shot translation systems, trained with unpaired
language data, for both high and low-resource language pairs. We show that with only 5 …

Error analysis prompting enables human-like translation evaluation in large language models: A case study on chatgpt

Q Lu, B Qiu, L Ding, L Xie, D Tao - 2023 - preprints.org
Generative large language models (LLMs), eg, ChatGPT, have demonstrated remarkable
proficiency across several NLP tasks such as machine translation, question answering, text …

Seamless: Multilingual Expressive and Streaming Speech Translation

L Barrault, YA Chung, MC Meglioli, D Dale… - arXiv preprint arXiv …, 2023 - arxiv.org
Large-scale automatic speech translation systems today lack key features that help machine-
mediated communication feel seamless when compared to human-to-human dialogue. In …

The impact of artificial intelligence on language translation: a review

YA Mohamed, A Khanan, M Bashir… - Ieee …, 2024 - ieeexplore.ieee.org
In the context of a more linked and globalized society, the significance of proficient cross-
cultural communication has been increasing to a position of utmost importance. Language …

Fast conformer with linearly scalable attention for efficient speech recognition

D Rekesh, NR Koluguri, S Kriman… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Conformer-based models have become the dominant end-to-end architecture for speech
processing tasks. With the objective of enhancing the conformer architecture for efficient …

Unity: Two-pass direct speech-to-speech translation with discrete units

H Inaguma, S Popuri, I Kulikov, PJ Chen… - arXiv preprint arXiv …, 2022 - arxiv.org
Direct speech-to-speech translation (S2ST), in which all components can be optimized
jointly, is advantageous over cascaded approaches to achieve fast inference with a …

Translatotron 2: High-quality direct speech-to-speech translation with voice preservation

Y Jia, MT Ramanovich, T Remez… - … on Machine Learning, 2022 - proceedings.mlr.press
We present Translatotron 2, a neural direct speech-to-speech translation model that can be
trained end-to-end. Translatotron 2 consists of a speech encoder, a linguistic decoder, an …

Revisiting end-to-end speech-to-text translation from scratch

B Zhang, B Haddow… - … conference on machine …, 2022 - proceedings.mlr.press
Abstract End-to-end (E2E) speech-to-text translation (ST) often depends on pretraining its
encoder and/or decoder using source transcripts via speech recognition or text translation …

Findings of the iwslt 2023 evaluation campaign

M Agarwal, S Agarwal, A Anastasopoulos, L Bentivogli… - 2023 - um.edu.mt
This paper reports on the shared tasks organized by the 20th IWSLT Conference. The
shared tasks address 9 scientific challenges in spoken language translation: simultaneous …

Pre-training for speech translation: Ctc meets optimal transport

PH Le, H Gong, C Wang, J Pino… - International …, 2023 - proceedings.mlr.press
The gap between speech and text modalities is a major challenge in speech-to-text
translation (ST). Different methods have been proposed to reduce this gap, but most of them …