Multi-basis adaptive neural network for rapid adaptation in speech recognition

C Wu, MJF Gales - 2015 IEEE International Conference on …, 2015 - ieeexplore.ieee.org
Recent progress in acoustic modeling with deep neural network has significantly improved
the performance of automatic speech recognition systems. However, it remains as an open …

[PDF][PDF] An improved DNN-based approach to mispronunciation detection and diagnosis of L2 learners' speech.

W Hu, Y Qian, FK Soong - SLaTE, 2015 - researchgate.net
We extend the Goodness of Pronunciation (GOP) algorithm from the conventional GMM-
HMM to DNN-HMM and further optimize the GOP measure toward L2 language learners' …

Two efficient lattice rescoring methods using recurrent neural network language models

X Liu, X Chen, Y Wang, MJF Gales… - … /ACM Transactions on …, 2016 - ieeexplore.ieee.org
An important part of the language modelling problem for automatic speech recognition
(ASR) systems, and many other related applications, is to appropriately model long-distance …

[图书][B] Computers in the human interaction loop

A Waibel, R Stiefelhagen, R Carlson, J Casas… - 2010 - Springer
It is a common experience in our modern world, for us humans to be overwhelmed by the
complexities of technological artifacts around us, and by the attention they demand. While …

Spectro-temporal deep features for disordered speech assessment and recognition

M Geng, S Liu, J Yu, X Xie, S Hu, Z Ye, Z Jin… - arXiv preprint arXiv …, 2022 - arxiv.org
Automatic recognition of disordered speech remains a highly challenging task to date.
Sources of variability commonly found in normal speech including accent, age or gender …

Multiple proposals for continuous arabic sign language recognition

M Hassan, K Assaleh, T Shanableh - Sensing and Imaging, 2019 - Springer
The deaf community relies on sign language as the primary means of communication. For
the millions of people around the world who suffer from hearing loss, interaction with hearing …

Improving broadcast news transcription by lightly supervised discriminative training

HY Chan, P Woodland - 2004 IEEE International Conference …, 2004 - ieeexplore.ieee.org
We present our experiments on lightly supervised discriminative training with large amounts
of broadcast news data for which only closed caption transcriptions are available (TDT data) …

Towards automatic assessment of spontaneous spoken English

Y Wang, MJF Gales, KM Knill, K Kyriakopoulos… - Speech …, 2018 - Elsevier
With increasing global demand for learning English as a second language, there has been
considerable interest in methods of automatic assessment of spoken language proficiency …

[PDF][PDF] Data augmentation, feature combination, and multilingual neural networks to improve ASR and KWS performance for low-resource languages.

Z Tüske, P Golik, D Nolden, R Schlüter, H Ney - Interspeech, 2014 - academia.edu
This paper presents the progress of acoustic models for lowresourced languages
(Assamese, Bengali, Haitian Creole, Lao, Zulu) developed within the second evaluation …

GFCC based discriminatively trained noise robust continuous ASR system for Hindi language

M Dua, RK Aggarwal, M Biswas - Journal of Ambient Intelligence and …, 2019 - Springer
A statistically designed Automatic Speech Recognition (ASR) system extracts features from
speech signals using feature extraction methods, links the extracted features with the …