Large-scale multilingual speech recognition with a streaming end-to-end model

A Kannan, A Datta, TN Sainath, E Weinstein… - arXiv preprint arXiv …, 2019 - arxiv.org
Multilingual end-to-end (E2E) models have shown great promise in expansion of automatic
speech recognition (ASR) coverage of the world's languages. They have shown …

Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview

W Du, Y Maimaitiyiming, M Nijat, L Li, A Hamdulla… - Applied Sciences, 2022 - mdpi.com
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …

Code-switching in automatic speech recognition: The issues and future directions

MB Mustafa, MA Yusoof, HK Khalaf… - Applied Sciences, 2022 - mdpi.com
Code-switching (CS) in spoken language is where the speech has two or more languages
within an utterance. It is an unsolved issue in automatic speech recognition (ASR) research …

Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling

J Cho, MK Baskar, R Li, M Wiesner… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org
Sequence-to-sequence (seq2seq) approach for low-resource ASR is a relatively new
direction in speech research. The approach benefits by performing model training without …

Meta learning for end-to-end low-resource speech recognition

JY Hsu, YJ Chen, H Lee - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
In this paper, we proposed to apply meta learning approach for low-resource automatic
speech recognition (ASR). We formulated ASR for different languages as different tasks, and …

A survey of multilingual models for automatic speech recognition

H Yadav, S Sitaram - arXiv preprint arXiv:2202.12576, 2022 - arxiv.org
Although Automatic Speech Recognition (ASR) systems have achieved human-like
performance for a few languages, the majority of the world's languages do not have usable …

Transformer-transducers for code-switched speech recognition

S Dalmia, Y Liu, S Ronanki… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
We live in a world where 60% of the population can speak two or more languages fluently.
Members of these communities constantly switch between languages when having a …

[PDF][PDF] Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration.

S Khare, AR Mittal, A Diwan, S Sarawagi, P Jyothi… - Interspeech, 2021 - isca-archive.org
Cross-lingual transfer of knowledge from high-resource languages to low-resource
languages is an important research problem in automatic speech recognition (ASR). We …

Adversarial meta sampling for multilingual low-resource speech recognition

Y Xiao, K Gong, P Zhou, G Zheng, X Liang… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Low-resource automatic speech recognition (ASR) is challenging, as the low-resource target
language data cannot well train an ASR model. To solve this issue, meta-learning …

Language-agnostic multilingual modeling

A Datta, B Ramabhadran, J Emond… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Multilingual Automated Speech Recognition (ASR) systems allow for the joint training of
data-rich and data-scarce languages in a single model. This enables data and parameter …