RWTH OCR: A large vocabulary optical character recognition system for Arabic scripts

P Dreuw, D Rybach, G Heigold, H Ney - Guide to OCR for Arabic scripts, 2012 - Springer
We present a novel large vocabulary OCR system, which implements a confidence-and
margin-based discriminative training approach for model adaptation of an HMM-based …

A speech interface for air traffic control terminals

J Ferreiros, JM Pardo, R De Córdoba… - Aerospace Science and …, 2012 - Elsevier
Several issues concerning the current use of speech interfaces are discussed and the
design and development of a speech interface that enables air traffic controllers to command …

Application development with unified programming models

K Wang - US Patent 8,266,586, 2012 - Google Patents
A unified programming environment allows application developers to work with declarative,
procedural and service model based logic. In one aspect, instructions on a computer …

Phase AutoCorrelation (PAC) features for noise robust speech recognition

S Ikbal, H Misra, H Hermansky, M Magimai-Doss - Speech Communication, 2012 - Elsevier
In this paper, we introduce a new class of noise robust features derived from an alternative
measure of autocorrelation representing the phase variation of speech signal frame over …

Bayesian speaker adaptation based on a new hierarchical probabilistic model

WL Zhang, WQ Zhang, BC Li, D Qu… - IEEE transactions on …, 2012 - ieeexplore.ieee.org
In this paper, a new hierarchical Bayesian speaker adaptation method called HMAP is
proposed that combines the advantages of three conventional algorithms, maximum a …

[PDF][PDF] Uncertainty-based learning of Gaussian mixture models from noisy data

A Ozerov, M Lagrange, E Vincent - 2012 - researchgate.net
We consider the problem of Gaussian mixture model (GMM)-based classification of noisy
data, where the uncertainty over the data is given by a Gaussian distribution. While this …

[PDF][PDF] HMM based continuous EOG recognition for eye-input speech interface

F Fang, T Shinozaki, Y Horiuchi… - … Conference of the …, 2012 - t2r2-inside.star.titech.ac.jp
To provide an efficient means of communication for those who cannot move muscles of the
whole body except eyes due to amyotrophic lateral sclerosis (ALS), we are developing a …

New insights into hierarchical clustering and linguistic normalization for speaker diarization

S Bozonnet - 2012 - pastel.hal.science
The ever-expanding volume of available audio and multimedia data has elevated
technologies related to content indexing and structuring to the forefront of research. Speaker …

[图书][B] Acoustic model adaptation for recognition of dysarthric speech

HV Sharma - 2012 - search.proquest.com
Speech production errors characteristic of dysarthria are chiefly responsible for the low
accuracy of automatic speech recognition (ASR) when used by people diagnosed with the …

[PDF][PDF] New Insights into Hierarchical Clustering and Linguistic Normalization for Speaker Diarization

MJF BONASTRE - 2012 - eurecom.fr
The ever-expanding volume of available audio and multimedia data has elevated
technologies related to content indexing and structuring to the forefront of research. Speaker …