Minimum phone error and I-smoothing for improved discriminative training

C Wu, MJF Gales - 2015 IEEE International Conference on …, 2015 - ieeexplore.ieee.org

Recent progress in acoustic modeling with deep neural network has significantly improved
the performance of automatic speech recognition systems. However, it remains as an open …

被引用次数：81 相关文章所有 6 个版本

[PDF] researchgate.net

[PDF][PDF] An improved DNN-based approach to mispronunciation detection and diagnosis of L2 learners' speech.

W Hu, Y Qian, FK Soong - SLaTE, 2015 - researchgate.net

We extend the Goodness of Pronunciation (GOP) algorithm from the conventional GMM-
HMM to DNN-HMM and further optimize the GOP measure toward L2 language learners' …

被引用次数：64 相关文章所有 2 个版本

[PDF] cam.ac.uk

Two efficient lattice rescoring methods using recurrent neural network language models

X Liu, X Chen, Y Wang, MJF Gales… - … /ACM Transactions on …, 2016 - ieeexplore.ieee.org

An important part of the language modelling problem for automatic speech recognition
(ASR) systems, and many other related applications, is to appropriately model long-distance …

被引用次数：65 相关文章所有 6 个版本

[PDF] researchgate.net

[图书][B] Computers in the human interaction loop

A Waibel, R Stiefelhagen, R Carlson, J Casas… - 2010 - Springer

It is a common experience in our modern world, for us humans to be overwhelmed by the
complexities of technological artifacts around us, and by the attention they demand. While …

被引用次数：117 相关文章所有 20 个版本

[PDF] arxiv.org

Spectro-temporal deep features for disordered speech assessment and recognition

M Geng, S Liu, J Yu, X Xie, S Hu, Z Ye, Z Jin… - arXiv preprint arXiv …, 2022 - arxiv.org

Automatic recognition of disordered speech remains a highly challenging task to date.
Sources of variability commonly found in normal speech including accent, age or gender …

被引用次数：21 相关文章所有 8 个版本

[PDF] aus.edu

Multiple proposals for continuous arabic sign language recognition

M Hassan, K Assaleh, T Shanableh - Sensing and Imaging, 2019 - Springer

The deaf community relies on sign language as the primary means of communication. For
the millions of people around the world who suffer from hearing loss, interaction with hearing …

被引用次数：48 相关文章所有 5 个版本

[PDF] researchgate.net

Improving broadcast news transcription by lightly supervised discriminative training

HY Chan, P Woodland - 2004 IEEE International Conference …, 2004 - ieeexplore.ieee.org

We present our experiments on lightly supervised discriminative training with large amounts
of broadcast news data for which only closed caption transcriptions are available (TDT data) …

被引用次数：114 相关文章所有 13 个版本

[PDF] cam.ac.uk

Towards automatic assessment of spontaneous spoken English

Y Wang, MJF Gales, KM Knill, K Kyriakopoulos… - Speech …, 2018 - Elsevier

With increasing global demand for learning English as a second language, there has been
considerable interest in methods of automatic assessment of spoken language proficiency …

被引用次数：44 相关文章所有 5 个版本

[PDF] academia.edu

[PDF][PDF] Data augmentation, feature combination, and multilingual neural networks to improve ASR and KWS performance for low-resource languages.

Z Tüske, P Golik, D Nolden, R Schlüter, H Ney - Interspeech, 2014 - academia.edu

This paper presents the progress of acoustic models for lowresourced languages
(Assamese, Bengali, Haitian Creole, Lao, Zulu) developed within the second evaluation …

被引用次数：68 相关文章所有 8 个版本

GFCC based discriminatively trained noise robust continuous ASR system for Hindi language

M Dua, RK Aggarwal, M Biswas - Journal of Ambient Intelligence and …, 2019 - Springer

A statistically designed Automatic Speech Recognition (ASR) system extracts features from
speech signals using feature extraction methods, links the extracted features with the …

被引用次数：47 相关文章所有 2 个版本

高级搜索

QQ 群