Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org
Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview

W Du, Y Maimaitiyiming, M Nijat, L Li, A Hamdulla… - Applied Sciences, 2022 - mdpi.com
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …

Very deep multilingual convolutional neural networks for LVCSR

T Sercu, C Puhrsch, B Kingsbury… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) are a standard component of many current state-of-
the-art Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However …

Speech recognition and keyword spotting for low-resource languages: Babel project research at cued

MJF Gales, KM Knill, A Ragni… - … workshop on spoken …, 2014 - eprints.whiterose.ac.uk
Recently there has been increased interest in Automatic Speech Recognition (ASR) and
Key Word Spotting (KWS) systems for low resource languages. One of the driving forces for …

Data augmentation for low resource languages

A Ragni, KM Knill, SP Rath… - … 2014: 15th annual …, 2014 - eprints.whiterose.ac.uk
Recently there has been interest in the approaches for training speech recognition systems
for languages with limited resources. Under the IARPA Babel program such resources have …

Sequence-based multi-lingual low resource speech recognition

S Dalmia, R Sanabria, F Metze… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Techniques for multi-lingual and cross-lingual speech recognition can help in low resource
scenarios, to bootstrap systems and enable analysis of new languages and domains. End-to …

Maxout neurons for deep convolutional and LSTM neural networks in speech recognition

M Cai, J Liu - Speech Communication, 2016 - Elsevier
Deep neural networks (DNNs) have achieved great success in acoustic modeling for speech
recognition. However, DNNs with sigmoid neurons may suffer from the vanishing gradient …

Multilingual representations for low resource speech recognition and keyword search

J Cui, B Kingsbury, B Ramabhadran… - 2015 IEEE workshop …, 2015 - ieeexplore.ieee.org
This paper examines the impact of multilingual (ML) acoustic representations on Automatic
Speech Recognition (ASR) and keyword search (KWS) for low resource languages in the …

Transformer-transducers for code-switched speech recognition

S Dalmia, Y Liu, S Ronanki… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
We live in a world where 60% of the population can speak two or more languages fluently.
Members of these communities constantly switch between languages when having a …

Language-adversarial transfer learning for low-resource speech recognition

J Yi, J Tao, Z Wen, Y Bai - IEEE/ACM Transactions on Audio …, 2018 - ieeexplore.ieee.org
The acoustic model trained using the knowledge from the shared hidden layer (SHL) model
outperforms the model trained only by using the target language, especially under low …