Automatic speech recognition and speech variability: A review

M Benzeghiba, R De Mori, O Deroo, S Dupont… - Speech …, 2007 - Elsevier
Major progress is being recorded regularly on both the technology and exploitation of
automatic speech recognition (ASR) and spoken language systems. However, there are still …

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arXiv preprint arXiv …, 2020 - arxiv.org
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

wav2vec: Unsupervised pre-training for speech recognition

S Schneider, A Baevski, R Collobert, M Auli - arXiv preprint arXiv …, 2019 - arxiv.org
We explore unsupervised pre-training for speech recognition by learning representations of
raw audio. wav2vec is trained on large amounts of unlabeled audio data and the resulting …

Rethinking evaluation in asr: Are our models robust enough?

T Likhomanenko, Q Xu, V Pratap, P Tomasello… - arXiv preprint arXiv …, 2020 - arxiv.org
Is pushing numbers on a single benchmark valuable in automatic speech recognition?
Research results in acoustic modeling are typically evaluated based on performance on a …

[PDF][PDF] Tree-based state tying for high accuracy modelling

SJ Young, JJ Odell, PC Woodland - … Technology: Proceedings of a …, 1994 - aclanthology.org
The key problem to be faced when building a HMM-based continuous speech recogniser is
maintaining the balance between model complexity and available training data. For large …

End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition

D Palaz, M Magimai-Doss, R Collobert - Speech Communication, 2019 - Elsevier
In hidden Markov model (HMM) based automatic speech recognition (ASR) system,
modeling the statistical relationship between the acoustic speech signal and the HMM states …

System and method for natural language processing of query answers

IM Bennett - US Patent 7,702,508, 2010 - Google Patents
Candidate answers responsive to a user query are analyzed using a natural language
engine to determine appropriate answers from an electronic database. The system and meth …

Query engine for processing voice based queries including semantic decoding

IM Bennett - US Patent 7,725,307, 2010 - Google Patents
Primary Examiner Martin Lerner Related US Application Data(74) Attorney, Agent, or Firm
PatentBest; Andrew McAleavey (63) Continuation-in-part of application No. 09/439,145, filed …

[PDF][PDF] The HTK hidden Markov model toolkit: Design and philosophy

SJ Young, S Young - 1993 - researchgate.net
HTK is an integrated suite of software tools for building and manipulating continuous density
Hidden Markov Models (HMMs). It consists of a set of library modules and a set of more than …

Distributed real time speech recognition system

IM Bennett, BR Babu, K Morkhandikar… - US Patent …, 2015 - Google Patents
(57) ABSTRACT A real-time system incorporating speech recognition and lin guistic
processing for recognizing a spoken query by a user and distributed between client and …