State-of-the-art speech recognition with sequence-to-sequence models

CC Chiu, TN Sainath, Y Wu… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS),
subsume the acoustic, pronunciation and language model components of a traditional …

Improving acoustic models in TORGO dysarthric speech database

NM Joy, S Umesh - IEEE Transactions on Neural Systems and …, 2018 - ieeexplore.ieee.org
Assistive speech-based technologies can improve the quality of life for people affected with
dysarthria, a motor speech disorder. In this paper, we explore multiple ways to improve …

[PDF][PDF] Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.

J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong… - Interspeech, 2018 - cs.ou.edu
Dysarthric speech recognition is a highly challenging task. The articulatory motor control
problems associated with neuromotor conditions produce large mismatch against normal …

The artificial intelligence renaissance: deep learning and the road to human-level machine intelligence

KH Tan, BP Lim - APSIPA Transactions on Signal and Information …, 2018 - cambridge.org
In this paper we look at recent advances in artificial intelligence. Decades in the making, a
confluence of several factors in the past few years has culminated in a string of …

Continuous Punjabi speech recognition model based on Kaldi ASR toolkit

J Guglani, AN Mishra - International Journal of Speech Technology, 2018 - Springer
In this paper, continuous Punjabi speech recognition model is presented using Kaldi toolkit.
For speech recognition, the extraction of Mel frequency cepstral coefficients (MFCC) features …

Towards automatic assessment of spontaneous spoken English

Y Wang, MJF Gales, KM Knill, K Kyriakopoulos… - Speech …, 2018 - Elsevier
With increasing global demand for learning English as a second language, there has been
considerable interest in methods of automatic assessment of spoken language proficiency …

[PDF][PDF] A probabilistic formulation of keyword spotting

J Puigcerver - PhD thesis, 2018 - pdfs.semanticscholar.org
This thesis, first defines the goal of Keyword Spotting from a Decision Theory perspective.
Then, the problem is tackled following a probabilistic formulation. More precisely, Keyword …

[PDF][PDF] Investigation on LSTM recurrent n-gram language models for speech recognition

Z Tüske, R Schlüter, H Ney - Interspeech, 2018 - www-i6.informatik.rwth-aachen.de
Recurrent neural networks (NN) with long short-term memory (LSTM) are the current state of
the art to model long term dependencies. However, recent studies indicate that NN …

Turkish speech recognition based on deep neural networks

UA Kımanuka, O Buyuk - Süleyman Demirel Üniversitesi Fen …, 2018 - dergipark.org.tr
In this paper we develop a Turkish speech recognition (SR) system using deep neural
networks and compare it with the previous state-of-the-art traditional Gaussian mixture …

[PDF][PDF] Paired phone-posteriors approach to ESL pronunciation quality assessment

Y Xiao, FK Soong, W Hu - bdl, 2018 - isca-archive.org
This work proposes to incorporate paired phone-posteriors as input features into a neural
net (NN) model for assessing ESL learner's pronunciation quality. In this work, posteriors of …