Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org
Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

Acoustic-based sensing and applications: A survey

Y Bai, L Lu, J Cheng, J Liu, Y Chen, J Yu - Computer Networks, 2020 - Elsevier
With advancements of wireless and sensing technologies, recent studies have
demonstrated technical feasibility and effectiveness of using acoustic signals for sensing. In …

The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices

T Yoshioka, N Ito, M Delcroix, A Ogawa… - … IEEE Workshop on …, 2015 - ieeexplore.ieee.org
CHiME-3 is a research community challenge organised in 2015 to evaluate speech
recognition systems for mobile multi-microphone devices used in noisy daily environments …

Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models

P Swietojanski, S Renals - 2014 IEEE Spoken Language …, 2014 - ieeexplore.ieee.org
This paper proposes a simple yet effective model-based neural network speaker adaptation
technique that learns speaker-specific hidden unit contributions given adaptation data …

Learning hidden unit contributions for unsupervised acoustic model adaptation

P Swietojanski, J Li, S Renals - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
This work presents a broad study on the adaptation of neural network acoustic models by
means of learning hidden unit contributions (LHUC)-a method that linearly re-combines …

Semi-supervised training of deep neural networks

K Veselý, M Hannemann… - 2013 IEEE Workshop on …, 2013 - ieeexplore.ieee.org
In this paper we search for an optimal strategy for semi-supervised Deep Neural Network
(DNN) training. We assume that a small part of the data is transcribed, while the majority of …

Exploring self-supervised pre-trained asr models for dysarthric and elderly speech recognition

S Hu, X Xie, Z Jin, M Geng, Y Wang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Automatic recognition of disordered and elderly speech remains a highly challenging task to
date due to the difficulty in collecting such data in large quantities. This paper explores a …

Exploring attention mechanisms based on summary information for end-to-end automatic speech recognition

J Xue, T Zheng, J Han - Neurocomputing, 2021 - Elsevier
Recent studies have confirmed that attention mechanisms with location constraint strategy
are helpful to reduce the misrecognition caused by incorrect alignments in attention-based …

Towards automatic assessment of spontaneous spoken English

Y Wang, MJF Gales, KM Knill, K Kyriakopoulos… - Speech …, 2018 - Elsevier
With increasing global demand for learning English as a second language, there has been
considerable interest in methods of automatic assessment of spoken language proficiency …

[HTML][HTML] Robust i-vector based adaptation of DNN acoustic model for speech recognition

S Garimella, A Mandal, N Ström, B Hoffmeister… - 2015 - amazon.science
In the past, conventional i-vectors based on a Universal Background Model (UBM) have
been successfully used as input features to adapt a Deep Neural Network (DNN) Acoustic …