Speech recognition in noisy environments: A survey

Y Gong - Speech communication, 1995 - Elsevier
The performance levels of most current speech recognizers degrade significantly when
environmental noise occurs during use. Such performance degradation is mainly caused by …

Continuous probabilistic transform for voice conversion

Y Stylianou, O Cappé… - IEEE Transactions on …, 1998 - ieeexplore.ieee.org
Voice conversion, as considered in this paper, is defined as modifying the speech signal of
one speaker (source speaker) so that it sounds as if it had been pronounced by a different …

Nonparallel training for voice conversion based on a parameter adaptation approach

A Mouchtaris, J Van der Spiegel… - IEEE Transactions on …, 2006 - ieeexplore.ieee.org
The objective of voice conversion algorithms is to modify the speech by a particular source
speaker so that it sounds as if spoken by a different target speaker. Current conversion …

A system for voice conversion based on probabilistic classification and a harmonic plus noise model

Y Stylianou, O Cappe - … and Signal Processing, ICASSP'98 (Cat …, 1998 - ieeexplore.ieee.org
Voice conversion is defined as modifying the speech signal of one speaker (source speaker)
so that it sounds as if it had been pronounced by a different speaker (target speaker). This …

[图书][B] Speech processing in mobile environments

KS Rao, AK Vuppala - 2014 - Springer
Robust speech systems in mobile environment have gained a special interest in recent
years in order to enable access to remote voice-activated services. In this context, three …

Stereo-based stochastic mapping for robust speech recognition

M Afify, X Cui, Y Gao - IEEE transactions on audio, speech, and …, 2009 - ieeexplore.ieee.org
We present a stochastic mapping technique for robust speech recognition that uses stereo
data. The idea is based on constructing a Gaussian mixture model for the joint distribution of …

A study of variable-parameter Gaussian mixture hidden Markov modeling for noisy speech recognition

X Cui, Y Gong - IEEE transactions on audio, speech, and …, 2007 - ieeexplore.ieee.org
To improve recognition performance in noisy environments, multicondition training is usually
applied in which speech signals corrupted by a variety of noise are used in acoustic model …

Automatic word recognition in cars

CE Mokbel, GFA Chollet - IEEE Transactions on Speech and …, 1995 - ieeexplore.ieee.org
The paper compares, on a database recorded in a car, a number of signal analysis and
speech enhancement techniques as well as some approaches to adapt speech recognition …

A novel framework and training algorithm for variable-parameter hidden Markov models

D Yu, L Deng, Y Gong, A Acero - IEEE transactions on audio …, 2009 - ieeexplore.ieee.org
We propose a new framework and the associated maximum-likelihood and discriminative
training algorithms for the variable-parameter hidden Markov model (VPHMM) whose mean …

Towards improving ASR robustness for PSN and GSM telephone applications

C Mokbel, L Mauuary, L Karray, D Jouvet, J Monné… - Speech …, 1997 - Elsevier
In real-life applications, errors in the speech recognition system are mainly due to inefficient
detection of speech segments, unreliable rejection of Out-Of-Vocabulary (OOV) words, and …