Automatic speech recognition using advanced deep learning approaches: A survey

H Kheddar, M Hemis, Y Himeur - Information Fusion, 2024 - Elsevier
Recent advancements in deep learning (DL) have posed a significant challenge for
automatic speech recognition (ASR). ASR relies on extensive training datasets, including …

Deep transfer learning for automatic speech recognition: Towards better generalization

H Kheddar, Y Himeur, S Al-Maadeed, A Amira… - Knowledge-Based …, 2023 - Elsevier
Automatic speech recognition (ASR) has recently become an important challenge when
using deep learning (DL). It requires large-scale training datasets and high computational …

End-to-end neural systems for automatic children speech recognition: An empirical study

PG Shivakumar, S Narayanan - Computer Speech & Language, 2022 - Elsevier
A key desiderata for inclusive and accessible speech recognition technology is ensuring its
robust performance to children's speech. Notably, this includes the rapidly advancing neural …

A wav2vec2-based experimental study on self-supervised learning methods to improve child speech recognition

R Jain, A Barcovschi, MY Yiwere, D Bigioi… - IEEE …, 2023 - ieeexplore.ieee.org
Despite recent advancements in deep learning technologies, Child Speech Recognition
remains a challenging task. Current Automatic Speech Recognition (ASR) models require …

Audio augmentation for non-native children's speech recognition through discriminative learning

K Radha, M Bansal - Entropy, 2022 - mdpi.com
Automatic speech recognition (ASR) in children is a rapidly evolving field, as children
become more accustomed to interacting with virtual assistants, such as Amazon Echo …

Comprehensive literature review on children automatic speech recognition system, acoustic linguistic mismatch approaches and challenges

R Sobti, K Guleria, V Kadyan - Multimedia Tools and Applications, 2024 - Springer
Abstract Automatic Speech Recognition (ASR) system for children is as important as for
adults since children are more dependent on these systems nowadays, such as computer …

Using data augmentations and vtln to reduce bias in dutch end-to-end speech recognition systems

T Patel, O Scharenborg - arXiv preprint arXiv:2307.02009, 2023 - arxiv.org
Speech technology has improved greatly for norm speakers, ie, adult native speakers of a
language without speech impediments or strong accents. However, non-norm or diverse …

End-to-end acoustic modelling for phone recognition of young readers

L Gelin, M Daniel, J Pinquier, T Pellegrini - Speech Communication, 2021 - Elsevier
Automatic recognition systems for child speech are lagging behind those dedicated to adult
speech in the race of performance. This phenomenon is due to the high acoustic and …

Exploring data augmentation in bias mitigation against non-native-accented speech

Y Zhang, A Herygers, T Patel, Z Yue… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Automatic speech recognition (ASR) should serve every speaker, not only the majority
“standard” speakers of a language. In order to build inclusive ASR, mitigating the bias …

Analyzing and visualizing deep neural networks for speech recognition with saliency-adjusted neuron activation profiles

A Krug, M Ebrahimzadeh, J Alemann, J Johannsmeier… - Electronics, 2021 - mdpi.com
Deep Learning-based Automatic Speech Recognition (ASR) models are very successful, but
hard to interpret. To gain a better understanding of how Artificial Neural Networks (ANNs) …