[PDF][PDF] End-to-end Speech Recognition Using Lattice-free MMI.

H Hadian, H Sameti, D Povey, S Khudanpur - Interspeech, 2018 - danielpovey.com
We present our work on end-to-end training of acoustic models using the lattice-free
maximum mutual information (LF-MMI) objective function in the context of hidden Markov …

[PDF][PDF] Discriminative training for large vocabulary speech recognition

D Povey - 2005 - researchgate.net
This thesis investigates the use of discriminative criteria for training HMM parameters for
speech recognition, in particular the Maximum Mutual Information (MMI) criterion and a new …

Large scale discriminative training of hidden Markov models for speech recognition

PC Woodland, D Povey - Computer Speech & Language, 2002 - Elsevier
This paper describes, and evaluates on a large scale, the lattice based framework for
discriminative training of large vocabulary speech recognition systems based on Gaussian …

Discriminative learning in sequential pattern recognition

X He, L Deng, W Chou - IEEE Signal Processing Magazine, 2008 - ieeexplore.ieee.org
In this article, we studied the objective functions of MMI, MCE, and MPE/MWE for
discriminative learning in sequential pattern recognition. We presented an approach that …

MMIE training of large vocabulary recognition systems

V Valtchev, JJ Odell, PC Woodland, SJ Young - Speech Communication, 1997 - Elsevier
This paper describes a framework for optimising the structure and parameters of a
continuous density HMM-based large vocabulary recognition system using the Maximum …

Discriminative training for large-vocabulary speech recognition using minimum classification error

E McDermott, TJ Hazen, J Le Roux… - … on Audio, Speech …, 2006 - ieeexplore.ieee.org
The minimum classification error (MCE) framework for discriminative training is a simple and
general formalism for directly optimizing recognition accuracy in pattern recognition …

Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method

S Katagiri, BH Juang, CH Lee - Proceedings of the IEEE, 1998 - ieeexplore.ieee.org
This paper provides a comprehensive introduction to a novel approach to pattern
recognition which is based on the generalized probabilistic descent method (GPD) and its …

Pychain: A fully parallelized pytorch implementation of lf-mmi for end-to-end asr

Y Shao, Y Wang, D Povey, S Khudanpur - arXiv preprint arXiv:2005.09824, 2020 - arxiv.org
We present PyChain, a fully parallelized PyTorch implementation of end-to-end lattice-free
maximum mutual information (LF-MMI) training for the so-called\emph {chain models} in the …

[图书][B] Pattern recognition in speech and language processing

W Chou, BH Juang - 2003 - books.google.com
Over the last 20 years, approaches to designing speech and language processing
algorithms have moved from methods based on linguistics and speech science to data …

Optimizing expected word error rate via sampling for speech recognition

M Shannon - arXiv preprint arXiv:1706.02776, 2017 - arxiv.org
State-level minimum Bayes risk (sMBR) training has become the de facto standard for
sequence-level training of speech recognition acoustic models. It has an elegant formulation …