Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition

GE Dahl, D Yu, L Deng, A Acero - IEEE Transactions on audio …, 2011 - ieeexplore.ieee.org
We propose a novel context-dependent (CD) model for large-vocabulary speech recognition
(LVSR) that leverages recent advances in using deep belief networks for phone recognition …

Real-time classification and sensor fusion with a spiking deep belief network

P O'Connor, D Neil, SC Liu, T Delbruck… - Frontiers in …, 2013 - frontiersin.org
Deep Belief Networks (DBNs) have recently shown impressive performance on a broad
range of classification problems. Their generative properties allow better understanding of …

An introduction to voice search

YY Wang, D Yu, YC Ju, A Acero - IEEE Signal Processing …, 2008 - ieeexplore.ieee.org
Voice search is the technology underlying many spoken dialog systems (SDSs) that provide
users with the information they request with a spoken query. The information normally exists …

[PDF][PDF] Improved bottleneck features using pretrained deep neural networks

D Yu, ML Seltzer - Twelfth annual conference of the …, 2011 - jackyguo624.github.io
Bottleneck features have been shown to be effective in improving the accuracy of automatic
speech recognition (ASR) systems. Conventionally, bottleneck features are extracted from a …

[PDF][PDF] Roles of pre-training and fine-tuning in context-dependent DBN-HMMs for real-world speech recognition

D Yu, L Deng, G Dahl - Proc. NIPS Workshop on Deep Learning and …, 2010 - microsoft.com
Recently, deep learning techniques have been successfully applied to automatic speech
recognition tasks--first to phonetic recognition with context-independent deep belief network …

Large vocabulary continuous speech recognition with context-dependent DBN-HMMs

GE Dahl, D Yu, L Deng, A Acero - 2011 IEEE international …, 2011 - ieeexplore.ieee.org
The context-independent deep belief network (DBN) hidden Markov model (HMM) hybrid
architecture has recently achieved promising results for phone recognition. In this work, we …

Searching by talking: Analysis of voice queries on mobile web search

I Guy - Proceedings of the 39th International ACM SIGIR …, 2016 - dl.acm.org
The growing popularity of mobile search and the advancement in voice recognition
technologies have opened the door for web search users to speak their queries, rather than …

Long short-term memory recurrent neural network-based acoustic model using connectionist temporal classification on a large-scale training corpus

D Lee, M Lim, H Park, Y Kang, JS Park… - China …, 2017 - ieeexplore.ieee.org
A Long Short-Term Memory (LSTM) Recurrent Neural Network (RNN) has driven
tremendous improvements on an acoustic model based on Gaussian Mixture Model (GMM) …

A segmental CRF approach to large vocabulary continuous speech recognition

G Zweig, P Nguyen - 2009 IEEE Workshop on Automatic …, 2009 - ieeexplore.ieee.org
This paper proposes a segmental conditional random field framework for large vocabulary
continuous speech recognition. Fundamental to this approach is the use of acoustic …

Calibration of confidence measures in speech recognition

D Yu, J Li, L Deng - IEEE Transactions on Audio, Speech, and …, 2011 - ieeexplore.ieee.org
Most speech recognition applications in use today rely heavily on confidence measure for
making optimal decisions. In this paper, we aim to answer the question: what can be done to …