A historical perspective of speech recognition

X Huang, J Baker, R Reddy - Communications of the ACM, 2014 - dl.acm.org
A historical perspective of speech recognition Page 1 review articles 94 communIcaTIonS of
The acm | jANuARY 2014 | vol. 57 | No. 1 WItH tHe IntroDUctIon of Apple’s Siri and similar …

Machine learning in automatic speech recognition: A survey

J Padmanabhan… - IETE Technical Review, 2015 - Taylor & Francis
Over the past few decades, there has been tremendous development in machine learning
paradigms used in automatic speech recognition (ASR) for home automation to space …

Automatic speech recognition: a survey

M Malik, MK Malik, K Mehmood… - Multimedia Tools and …, 2021 - Springer
Recently great strides have been made in the field of automatic speech recognition (ASR) by
using various deep learning techniques. In this study, we present a thorough comparison …

[图书][B] Automatic speech recognition

D Yu, L Deng - 2016 - Springer
Automatic Speech Recognition (ASR), which is aimed to enable natural human–machine
interaction, has been an intensive research area for decades. Many core technologies, such …

Listen, attend and spell

W Chan, N Jaitly, QV Le, O Vinyals - arXiv preprint arXiv:1508.01211, 2015 - arxiv.org
We present Listen, Attend and Spell (LAS), a neural network that learns to transcribe speech
utterances to characters. Unlike traditional DNN-HMM models, this model learns all the …

Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition

GE Dahl, D Yu, L Deng, A Acero - IEEE Transactions on audio …, 2011 - ieeexplore.ieee.org
We propose a novel context-dependent (CD) model for large-vocabulary speech recognition
(LVSR) that leverages recent advances in using deep belief networks for phone recognition …

Hybrid autoregressive transducer (hat)

E Variani, D Rybach, C Allauzen… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
This paper proposes and evaluates the hybrid autoregressive transducer (HAT) model, a
time-synchronous encoder-decoder model that preserves the modularity of conventional …

[图书][B] Connectionist speech recognition: a hybrid approach

HA Bourlard, N Morgan - 2012 - books.google.com
Connectionist Speech Recognition: A Hybrid Approach describes the theory and
implementation of a method to incorporate neural network approaches into state of the art …

An application of recurrent nets to phone probability estimation

AJ Robinson - IEEE transactions on Neural Networks, 1994 - ieeexplore.ieee.org
This paper presents an application of recurrent networks for phone probability estimation in
large vocabulary speech recognition. The need for efficient exploitation of context …

[PDF][PDF] Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition.

N Jaitly, P Nguyen, AW Senior, V Vanhoucke - Interspeech, 2012 - isca-archive.org
Abstract The use of Deep Belief Networks (DBN) to pretrain Neural Networks has recently
led to a resurgence in the use of Artificial Neural Network-Hidden Markov Model …