Findings of the IWSLT 2022 Evaluation Campaign.

A Anastasopoulos, L Barrault, L Bentivogli… - Proceedings of the 19th …, 2022 - cris.fbk.eu
The evaluation campaign of the 19th International Conference on Spoken Language
Translation featured eight shared tasks:(i) Simultaneous speech translation,(ii) Offline …

Attention as a guide for simultaneous speech translation

S Papi, M Negri, M Turchi - arXiv preprint arXiv:2212.07850, 2022 - arxiv.org
The study of the attention mechanism has sparked interest in many fields, such as language
modeling and machine translation. Although its patterns have been exploited to perform …

Over-generation cannot be rewarded: Length-adaptive average lagging for simultaneous speech translation

S Papi, M Gaido, M Negri, M Turchi - arXiv preprint arXiv:2206.05807, 2022 - arxiv.org
Simultaneous speech translation (SimulST) systems aim at generating their output with the
lowest possible latency, which is normally computed in terms of Average Lagging (AL). In …

Direct speech translation for automatic subtitling

S Papi, M Gaido, A Karakanta, M Cettolo… - Transactions of the …, 2023 - direct.mit.edu
Automatic subtitling is the task of automatically translating the speech of audiovisual content
into short pieces of timed text, ie, subtitles and their corresponding timestamps. The …

When good and reproducible results are a giant with feet of clay: The importance of software quality in nlp

S Papi, M Gaido, A Pilzer, M Negri - arXiv preprint arXiv:2303.16166, 2023 - arxiv.org
Despite its crucial role in research experiments, code correctness is often presumed only on
the basis of the perceived quality of results. This assumption comes with the risk of …

Efficient CTC regularization via coarse labels for end-to-end speech translation

B Zhang, B Haddow, R Sennrich - arXiv preprint arXiv:2302.10871, 2023 - arxiv.org
For end-to-end speech translation, regularizing the encoder with the Connectionist
Temporal Classification (CTC) objective using the source transcript or target translation as …

[PDF][PDF] Revamping the SLTev Tool for Evaluation of Spoken Language Translation

M Elizabeth, O Bojar - The Prague Bulletin of Mathematical …, 2024 - ufal.mff.cuni.cz
This article describes recent improvements of SLTev, a tool for automatic evaluation of
machine translation, speech recognition and speech translation systems. The changes …

[PDF][PDF] Manipulating Data Representations for Neural Machine Translation

C Amrhein - 2023 - zora.uzh.ch
In natural language processing, much current research focuses on training larger and larger
models on more and more data. In this thesis, we argue that how data is represented can …

Direct Speech Translation in Constrained Contexts: the Simultaneous and Subtitling Scenarios

S Papi - 2024 - iris.unitn.it
This PhD thesis summarizes the results of a three-year comprehensive investigation into the
dynamic domain of speech translation (ST), with a specific emphasis on application …