A survey on automatic speech recognition systems for Portuguese language and its variations

TA de Lima, M Da Costa-Abreu - Computer Speech & Language, 2020 - Elsevier
Communication has been an essential part of being human and living in society. There are
several different languages and variations of them, so you can speak English in one place …

[PDF][PDF] End-to-End Accented Speech Recognition.

T Viglino, P Motlicek, M Cernak - Interspeech, 2019 - isca-archive.org
Correct pronunciation is known to be the most difficult part to acquire for (native or non-
native) language learners. The accented speech is thus more variable, and standard …

Pseudo-labeling for massively multilingual speech recognition

L Lugosch, T Likhomanenko… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Semi-supervised learning through pseudo-labeling has become a staple of state-of-the-art
monolingual speech recognition systems. In this work, we extend pseudo-labeling to …

Transfer learning of language-independent end-to-end ASR with language model fusion

H Inaguma, J Cho, MK Baskar… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
This work explores better adaptation methods to low-resource languages using an external
language model (LM) under the framework of transfer learning. We first build a language …

Cross-lingual self-training to learn multilingual representation for low-resource speech recognition

ZQ Zhang, Y Song, MH Wu, X Fang… - Circuits, Systems, and …, 2022 - Springer
Abstract Representation learning or pre-training has shown promising performance for low-
resource speech recognition which suffers from the data shortage. Recently, self-supervised …

Towards robust mispronunciation detection and diagnosis for L2 English learners with accent-modulating methods

SWF Jiang, BC Yan, TH Lo, FA Chao… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
With the acceleration of globalization, more and more people are willing or required to learn
second languages (L2). One of the major remaining challenges facing current …

Mutual-learning sequence-level knowledge distillation for automatic speech recognition

Z Li, Y Ming, L Yang, JH Xue - Neurocomputing, 2021 - Elsevier
Automatic speech recognition (ASR) is a crucial technology for man-machine interaction.
End-to-end models have been studied recently in deep learning for ASR. However, these …

Discovering phonetic inventories with crosslingual automatic speech recognition

P Żelasko, S Feng, LM Velazquez, A Abavisani… - Computer Speech & …, 2022 - Elsevier
The high cost of data acquisition makes Automatic Speech Recognition (ASR) model
training problematic for most existing languages, including languages that do not even have …

Computational intelligence in processing of speech acoustics: a survey

A Singh, N Kaur, V Kukreja, V Kadyan… - Complex & Intelligent …, 2022 - Springer
Speech recognition of a language is a key area in the field of pattern recognition. This paper
presents a comprehensive survey on the speech recognition techniques for non-Indian and …

Optimizing data usage for low-resource speech recognition

Y Qian, Z Zhou - IEEE/ACM Transactions on Audio, Speech …, 2022 - ieeexplore.ieee.org
Automatic speech recognition has made huge progress recently. However, the current
modeling strategy still suffers a large performance degradation when facing the low …