Flexible speaker adaptation using maximum likelihood linear regression

P Dreuw, D Rybach, G Heigold, H Ney - Guide to OCR for Arabic scripts, 2012 - Springer

We present a novel large vocabulary OCR system, which implements a confidence-and
margin-based discriminative training approach for model adaptation of an HMM-based …

被引用次数：38 相关文章所有 6 个版本

[PDF] upm.es

A speech interface for air traffic control terminals

J Ferreiros, JM Pardo, R De Córdoba… - Aerospace Science and …, 2012 - Elsevier

Several issues concerning the current use of speech interfaces are discussed and the
design and development of a speech interface that enables air traffic controllers to command …

被引用次数：22 相关文章所有 18 个版本

[PDF] googleapis.com

Application development with unified programming models

K Wang - US Patent 8,266,586, 2012 - Google Patents

A unified programming environment allows application developers to work with declarative,
procedural and service model based logic. In one aspect, instructions on a computer …

被引用次数：41 相关文章所有 4 个版本

Phase AutoCorrelation (PAC) features for noise robust speech recognition

S Ikbal, H Misra, H Hermansky, M Magimai-Doss - Speech Communication, 2012 - Elsevier

In this paper, we introduce a new class of noise robust features derived from an alternative
measure of autocorrelation representing the phase variation of speech signal frame over …

被引用次数：16 相关文章所有 5 个版本

[PDF] marquette.edu

Bayesian speaker adaptation based on a new hierarchical probabilistic model

WL Zhang, WQ Zhang, BC Li, D Qu… - IEEE transactions on …, 2012 - ieeexplore.ieee.org

In this paper, a new hierarchical Bayesian speaker adaptation method called HMAP is
proposed that combines the advantages of three conventional algorithms, maximum a …

被引用次数：9 相关文章所有 14 个版本

[PDF] researchgate.net

[PDF][PDF] Uncertainty-based learning of Gaussian mixture models from noisy data

A Ozerov, M Lagrange, E Vincent - 2012 - researchgate.net

We consider the problem of Gaussian mixture model (GMM)-based classification of noisy
data, where the uncertainty over the data is given by a Gaussian distribution. While this …

被引用次数：7 相关文章所有 5 个版本

[PDF] titech.ac.jp

[PDF][PDF] HMM based continuous EOG recognition for eye-input speech interface

F Fang, T Shinozaki, Y Horiuchi… - … Conference of the …, 2012 - t2r2-inside.star.titech.ac.jp

To provide an efficient means of communication for those who cannot move muscles of the
whole body except eyes due to amyotrophic lateral sclerosis (ALS), we are developing a …

被引用次数：6 相关文章所有 8 个版本

[PDF] hal.science

New insights into hierarchical clustering and linguistic normalization for speaker diarization

S Bozonnet - 2012 - pastel.hal.science

The ever-expanding volume of available audio and multimedia data has elevated
technologies related to content indexing and structuring to the forefront of research. Speaker …

被引用次数：4 相关文章所有 7 个版本

[PDF] illinois.edu

[图书][B] Acoustic model adaptation for recognition of dysarthric speech

HV Sharma - 2012 - search.proquest.com

Speech production errors characteristic of dysarthria are chiefly responsible for the low
accuracy of automatic speech recognition (ASR) when used by people diagnosed with the …

被引用次数：4 相关文章所有 2 个版本

[PDF] eurecom.fr

[PDF][PDF] New Insights into Hierarchical Clustering and Linguistic Normalization for Speaker Diarization

MJF BONASTRE - 2012 - eurecom.fr

The ever-expanding volume of available audio and multimedia data has elevated
technologies related to content indexing and structuring to the forefront of research. Speaker …

高级搜索

QQ 群