Findings of the IWSLT 2022 Evaluation Campaign.

X Garcia, Y Bansal, C Cherry, G Foster… - International …, 2023 - proceedings.mlr.press

We demonstrate the potential of few-shot translation systems, trained with unpaired
language data, for both high and low-resource language pairs. We show that with only 5 …

被引用次数：58 相关文章所有 6 个版本

[PDF] preprints.org

Error analysis prompting enables human-like translation evaluation in large language models: A case study on chatgpt

Q Lu, B Qiu, L Ding, L Xie, D Tao - 2023 - preprints.org

Generative large language models (LLMs), eg, ChatGPT, have demonstrated remarkable
proficiency across several NLP tasks such as machine translation, question answering, text …

被引用次数：112 相关文章所有 5 个版本

[PDF] arxiv.org

Seamless: Multilingual Expressive and Streaming Speech Translation

L Barrault, YA Chung, MC Meglioli, D Dale… - arXiv preprint arXiv …, 2023 - arxiv.org

Large-scale automatic speech translation systems today lack key features that help machine-
mediated communication feel seamless when compared to human-to-human dialogue. In …

被引用次数：69 相关文章

[PDF] ieee.org

The impact of artificial intelligence on language translation: a review

YA Mohamed, A Khanan, M Bashir… - Ieee …, 2024 - ieeexplore.ieee.org

In the context of a more linked and globalized society, the significance of proficient cross-
cultural communication has been increasing to a position of utmost importance. Language …

被引用次数：21 相关文章所有 3 个版本

[PDF] arxiv.org

Fast conformer with linearly scalable attention for efficient speech recognition

D Rekesh, NR Koluguri, S Kriman… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

Conformer-based models have become the dominant end-to-end architecture for speech
processing tasks. With the objective of enhancing the conformer architecture for efficient …

被引用次数：53 相关文章所有 3 个版本

[PDF] arxiv.org

Unity: Two-pass direct speech-to-speech translation with discrete units

H Inaguma, S Popuri, I Kulikov, PJ Chen… - arXiv preprint arXiv …, 2022 - arxiv.org

Direct speech-to-speech translation (S2ST), in which all components can be optimized
jointly, is advantageous over cascaded approaches to achieve fast inference with a …

被引用次数：35 相关文章所有 6 个版本

[PDF] mlr.press

Translatotron 2: High-quality direct speech-to-speech translation with voice preservation

Y Jia, MT Ramanovich, T Remez… - … on Machine Learning, 2022 - proceedings.mlr.press

We present Translatotron 2, a neural direct speech-to-speech translation model that can be
trained end-to-end. Translatotron 2 consists of a speech encoder, a linguistic decoder, an …

被引用次数：63 相关文章所有 4 个版本

[PDF] mlr.press

Revisiting end-to-end speech-to-text translation from scratch

B Zhang, B Haddow… - … conference on machine …, 2022 - proceedings.mlr.press

Abstract End-to-end (E2E) speech-to-text translation (ST) often depends on pretraining its
encoder and/or decoder using source transcripts via speech recognition or text translation …

被引用次数：30 相关文章所有 7 个版本

[PDF] um.edu.mt

Findings of the iwslt 2023 evaluation campaign

M Agarwal, S Agarwal, A Anastasopoulos, L Bentivogli… - 2023 - um.edu.mt

This paper reports on the shared tasks organized by the 20th IWSLT Conference. The
shared tasks address 9 scientific challenges in spoken language translation: simultaneous …

被引用次数：38 相关文章所有 10 个版本

[PDF] mlr.press

Pre-training for speech translation: Ctc meets optimal transport

PH Le, H Gong, C Wang, J Pino… - International …, 2023 - proceedings.mlr.press

The gap between speech and text modalities is a major challenge in speech-to-text
translation (ST). Different methods have been proposed to reduce this gap, but most of them …

被引用次数：19 相关文章所有 12 个版本

高级搜索

QQ 群