Minimum phone error and I-smoothing for improved discriminative training

CC Chiu, TN Sainath, Y Wu… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS),
subsume the acoustic, pronunciation and language model components of a traditional …

被引用次数：1424 相关文章所有 10 个版本

[PDF] isca-archive.org

Improving acoustic models in TORGO dysarthric speech database

NM Joy, S Umesh - IEEE Transactions on Neural Systems and …, 2018 - ieeexplore.ieee.org

Assistive speech-based technologies can improve the quality of life for people affected with
dysarthria, a motor speech disorder. In this paper, we explore multiple ways to improve …

被引用次数：74 相关文章所有 9 个版本

[PDF] ou.edu

[PDF][PDF] Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.

J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong… - Interspeech, 2018 - cs.ou.edu

Dysarthric speech recognition is a highly challenging task. The articulatory motor control
problems associated with neuromotor conditions produce large mismatch against normal …

被引用次数：55 相关文章所有 9 个版本

[PDF] cambridge.org

The artificial intelligence renaissance: deep learning and the road to human-level machine intelligence

KH Tan, BP Lim - APSIPA Transactions on Signal and Information …, 2018 - cambridge.org

In this paper we look at recent advances in artificial intelligence. Decades in the making, a
confluence of several factors in the past few years has culminated in a string of …

被引用次数：59 相关文章所有 2 个版本

Continuous Punjabi speech recognition model based on Kaldi ASR toolkit

J Guglani, AN Mishra - International Journal of Speech Technology, 2018 - Springer

In this paper, continuous Punjabi speech recognition model is presented using Kaldi toolkit.
For speech recognition, the extraction of Mel frequency cepstral coefficients (MFCC) features …

被引用次数：56 相关文章所有 2 个版本

[PDF] cam.ac.uk

Towards automatic assessment of spontaneous spoken English

Y Wang, MJF Gales, KM Knill, K Kyriakopoulos… - Speech …, 2018 - Elsevier

With increasing global demand for learning English as a second language, there has been
considerable interest in methods of automatic assessment of spoken language proficiency …

被引用次数：44 相关文章所有 5 个版本

[PDF] semanticscholar.org

[PDF][PDF] A probabilistic formulation of keyword spotting

J Puigcerver - PhD thesis, 2018 - pdfs.semanticscholar.org

This thesis, first defines the goal of Keyword Spotting from a Decision Theory perspective.
Then, the problem is tackled following a probabilistic formulation. More precisely, Keyword …

被引用次数：42 相关文章

[PDF] rwth-aachen.de

[PDF][PDF] Investigation on LSTM recurrent n-gram language models for speech recognition

Z Tüske, R Schlüter, H Ney - Interspeech, 2018 - www-i6.informatik.rwth-aachen.de

Recurrent neural networks (NN) with long short-term memory (LSTM) are the current state of
the art to model long term dependencies. However, recent studies indicate that NN …

被引用次数：29 相关文章所有 10 个版本

[PDF] dergipark.org.tr

Turkish speech recognition based on deep neural networks

UA Kımanuka, O Buyuk - Süleyman Demirel Üniversitesi Fen …, 2018 - dergipark.org.tr

In this paper we develop a Turkish speech recognition (SR) system using deep neural
networks and compare it with the previous state-of-the-art traditional Gaussian mixture …

被引用次数：21 相关文章所有 9 个版本

[PDF] isca-archive.org

[PDF][PDF] Paired phone-posteriors approach to ESL pronunciation quality assessment

Y Xiao, FK Soong, W Hu - bdl, 2018 - isca-archive.org

This work proposes to incorporate paired phone-posteriors as input features into a neural
net (NN) model for assessing ESL learner's pronunciation quality. In this work, posteriors of …

被引用次数：19 相关文章所有 4 个版本

高级搜索

QQ 群