Continuous speech recognition using multilayer perceptrons with hidden Markov models

X Huang, J Baker, R Reddy - Communications of the ACM, 2014 - dl.acm.org

A historical perspective of speech recognition Page 1 review articles 94 communIcaTIonS of
The acm | jANuARY 2014 | vol. 57 | No. 1 WItH tHe IntroDUctIon of Apple’s Siri and similar …

被引用次数：249 相关文章所有 2 个版本

Machine learning in automatic speech recognition: A survey

J Padmanabhan… - IETE Technical Review, 2015 - Taylor & Francis

Over the past few decades, there has been tremendous development in machine learning
paradigms used in automatic speech recognition (ASR) for home automation to space …

被引用次数：220 相关文章

[PDF] researchgate.net

Automatic speech recognition: a survey

M Malik, MK Malik, K Mehmood… - Multimedia Tools and …, 2021 - Springer

Recently great strides have been made in the field of automatic speech recognition (ASR) by
using various deep learning techniques. In this study, we present a thorough comparison …

被引用次数：363 相关文章所有 8 个版本

[PDF] academia.edu

[图书][B] Automatic speech recognition

D Yu, L Deng - 2016 - Springer

Automatic Speech Recognition (ASR), which is aimed to enable natural human–machine
interaction, has been an intensive research area for decades. Many core technologies, such …

被引用次数：1607 相关文章所有 9 个版本

[PDF] arxiv.org

Listen, attend and spell

W Chan, N Jaitly, QV Le, O Vinyals - arXiv preprint arXiv:1508.01211, 2015 - arxiv.org

We present Listen, Attend and Spell (LAS), a neural network that learns to transcribe speech
utterances to characters. Unlike traditional DNN-HMM models, this model learns all the …

被引用次数：622 相关文章所有 6 个版本

[PDF] academia.edu

Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition

GE Dahl, D Yu, L Deng, A Acero - IEEE Transactions on audio …, 2011 - ieeexplore.ieee.org

We propose a novel context-dependent (CD) model for large-vocabulary speech recognition
(LVSR) that leverages recent advances in using deep belief networks for phone recognition …

被引用次数：4044 相关文章所有 19 个版本

[PDF] arxiv.org

Hybrid autoregressive transducer (hat)

E Variani, D Rybach, C Allauzen… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

This paper proposes and evaluates the hybrid autoregressive transducer (HAT) model, a
time-synchronous encoder-decoder model that preserves the modularity of conventional …

被引用次数：170 相关文章所有 6 个版本

[PDF] core.ac.uk

[图书][B] Connectionist speech recognition: a hybrid approach

HA Bourlard, N Morgan - 2012 - books.google.com

Connectionist Speech Recognition: A Hybrid Approach describes the theory and
implementation of a method to incorporate neural network approaches into state of the art …

被引用次数：2086 相关文章所有 13 个版本

[PDF] academia.edu

An application of recurrent nets to phone probability estimation

AJ Robinson - IEEE transactions on Neural Networks, 1994 - ieeexplore.ieee.org

This paper presents an application of recurrent networks for phone probability estimation in
large vocabulary speech recognition. The need for efficient exploitation of context …

被引用次数：735 相关文章所有 12 个版本

[PDF] isca-archive.org

[PDF][PDF] Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition.

N Jaitly, P Nguyen, AW Senior, V Vanhoucke - Interspeech, 2012 - isca-archive.org

Abstract The use of Deep Belief Networks (DBN) to pretrain Neural Networks has recently
led to a resurgence in the use of Artificial Neural Network-Hidden Markov Model …

被引用次数：368 相关文章所有 12 个版本

高级搜索

QQ 群