Robust speech recognition based on dereverberation parameter optimization using acoustic model likelihood

R Gomez, T Kawahara - IEEE transactions on audio, speech …, 2010 - ieeexplore.ieee.org
Automatic speech recognition (ASR) in reverberant environments is a challenging task. Most
dereverberation techniques address this problem through signal processing and enhances …

Multi-party human-robot interaction with distant-talking speech recognition

R Gomez, T Kawahara, K Nakamura… - Proceedings of the …, 2012 - dl.acm.org
Speech is one of the most natural medium for human communication, which makes it vital to
human-robot interaction. In real environments where robots are deployed, distant-talking …

Optimizing spectral subtraction and wiener filtering for robust speech recognition in reverberant and noisy conditions

R Gomez, T Kawahara - 2010 IEEE International Conference …, 2010 - ieeexplore.ieee.org
Speech enhancement is a common approach to address the effects of degradation due to
noise and channel contamination. This approach is intended to suppress unwanted signal …

[PDF][PDF] Robustness over time-varying channels in DNN-hmm ASR based human-robot interaction.

J Novoa, J Wuth, JP Escudero, J Fredes, R Mahu… - …, 2017 - isca-archive.org
This paper addresses the problem of time-varying channels in speech-recognition-based
human-robot interaction using Locally-Normalized Filter-Bank features (LNFB), and training …

[PDF][PDF] Optimization of dereverberation parameters based on likelihood of speech recognizer

R Gomez, T Kawahara - Tenth Annual Conference of the …, 2009 - isca-archive.org
Speech recognition under reverberant condition is a difficult task. Most dereverberation
techniques used to address this problem enhance the reverberant waveform independent …

Dereverberation robust to speaker's azimuthal orientation in multi-channel human-robot communication

R Gomez, K Nakamura… - 2013 IEEE/RSJ …, 2013 - ieeexplore.ieee.org
The acoustical dynamics of reverberation in an enclosed environment poses a problem to
human-robot communication. Any change in the azimuthal orientation of the speaker …

Utilizing visual cues in robot audition for sound source discrimination in speech-based human-robot communication

R Gomez, L Ivanchuk, K Nakamura… - 2015 IEEE/RSJ …, 2015 - ieeexplore.ieee.org
It is easy for human beings to discern whether an observed acoustic signal is a direct
speech, reflected speech or noise through simple listening. Relying purely on acoustic cues …

Optimized wavelet-domain filtering under noisy and reverberant conditions

R Gomez, T Kawahara, K Nakadai - APSIPA Transactions on Signal …, 2015 - cambridge.org
The paper addresses a robust wavelet-based speech enhancement for automatic speech
recognition in reverberant and noisy conditions. We propose a novel scheme in improving …

Speech-based human-robot interaction robust to acoustic reflections in real environment

R Gomez, K Inoue, K Nakamura… - 2014 IEEE/RSJ …, 2014 - ieeexplore.ieee.org
Acoustic reflection inside an enclosed environment is detrimental to human-robot
interaction. Reflection may manifest as phantom sources emanating from unknown …

[PDF][PDF] An improved wavelet-based dereverberation for robust automatic speech recognition

R Gomez, T Kawahara - Eleventh Annual Conference of the …, 2010 - sap.ist.i.kyoto-u.ac.jp
This paper presents an improved wavelet-based dereverberation method for automatic
speech recognition (ASR). Dereverberation is based on filtering reverberant wavelet …