Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview

W Du, Y Maimaitiyiming, M Nijat, L Li, A Hamdulla… - Applied Sciences, 2022 - mdpi.com
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …

Multilingual and code-switching ASR challenges for low resource Indian languages

A Diwan, R Vaideeswaran, S Shah, A Singh… - arXiv preprint arXiv …, 2021 - arxiv.org
Recently, there is increasing interest in multilingual automatic speech recognition (ASR)
where a speech recognition system caters to multiple low resource languages by taking …

Code-switching in automatic speech recognition: The issues and future directions

MB Mustafa, MA Yusoof, HK Khalaf… - Applied Sciences, 2022 - mdpi.com
Code-switching (CS) in spoken language is where the speech has two or more languages
within an utterance. It is an unsolved issue in automatic speech recognition (ASR) research …

Arabic code-switching speech recognition using monolingual data

A Ali, S Chowdhury, A Hussein, Y Hifny - arXiv preprint arXiv:2107.01573, 2021 - arxiv.org
Code-switching in automatic speech recognition (ASR) is an important challenge due to
globalization. Recent research in multilingual ASR shows potential improvement over …

Multilingual graphemic hybrid ASR with massive data augmentation

C Liu, Q Zhang, X Zhang, K Singh, Y Saraf… - arXiv preprint arXiv …, 2019 - arxiv.org
Towards developing high-performing ASR for low-resource languages, approaches to
address the lack of resources are to make use of data from multiple languages, and to …

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Z Liang, Z Song, Z Ma, C Du, K Yu, X Chen - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, end-to-end (E2E) automatic speech recognition (ASR) models have made great
strides and exhibit excellent performance in general speech recognition. However, there …

[PDF][PDF] Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems.

SR Madikeri, BK Khonglah, S Tong, P Motlicek… - …, 2020 - isca-archive.org
Multilingual acoustic model training combines data from multiple languages to train an
automatic speech recognition system. Such a system is beneficial when training data for a …

Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition

S Feng, M Tu, R Xia, C Huang, Y Wang - arXiv preprint arXiv:2305.11569, 2023 - arxiv.org
We improve low-resource ASR by integrating the ideas of multilingual training and self-
supervised learning. Concretely, we leverage an International Phonetic Alphabet (IPA) …

Injecting text and cross-lingual supervision in few-shot learning from self-supervised models

M Wiesner, D Raj, S Khudanpur - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Self-supervised model pretraining has recently garnered significant interest. However, using
additional resources in fine-tuning these models has received less attention. We …

Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings

C Zhu, K An, H Zheng, Z Ou - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org
The use of phonological features (PFs) potentially allows language-specific phones to
remain linked in training, which is highly desirable for information sharing for multilingual …