The PASCAL CHiME speech separation and recognition challenge

J Barker, E Vincent, N Ma, H Christensen… - Computer Speech & …, 2013 - Elsevier
Distant microphone speech recognition systems that operate with human-like robustness
remain a distant goal. The key difficulty is that operating in everyday listening conditions …

The second 'CHiME'speech separation and recognition challenge: Datasets, tasks and baselines

E Vincent, J Barker, S Watanabe… - … , Speech and Signal …, 2013 - ieeexplore.ieee.org
Distant-microphone automatic speech recognition (ASR) remains a challenging goal in
everyday environments involving multiple background sources and reverberation. This …

Learning dynamic stream weights for coupled-HMM-based audio-visual speech recognition

AH Abdelaziz, S Zeiler… - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
With the increasing use of multimedia data in communication technologies, the idea of
employing visual information in automatic speech recognition (ASR) has recently gathered …

Robust audiovisual speech recognition using noise-adaptive linear discriminant analysis

S Zeiler, R Nicheli, N Ma, GJ Brown… - … on acoustics, speech …, 2016 - ieeexplore.ieee.org
Automatic speech recognition (ASR) has become a widespread and convenient mode of
human-machine interaction, but it is still not sufficiently reliable when used under highly …

Uncertainty-based learning of acoustic models from noisy data

A Ozerov, M Lagrange, E Vincent - Computer Speech & Language, 2013 - Elsevier
We consider the problem of acoustic modeling of noisy speech data, where the uncertainty
over the data is given by a Gaussian distribution. While this uncertainty has been exploited …

Nonparametric uncertainty estimation and propagation for noise robust ASR

DT Tran, E Vincent, D Jouvet - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
We consider the framework of uncertainty propagation for automatic speech recognition
(ASR) in highly nonstationary noise environments. Uncertainty is considered as the variance …

[PDF][PDF] The TUM+ TUT+ KUL approach to the 2nd CHiME challenge: Multi-stream ASR exploiting BLSTM networks and sparse NMF

JT Geiger, F Weninger, A Hurmalainen… - Proc. 2nd CHiME …, 2013 - mediatum.ub.tum.de
We present our joint contribution to the 2nd CHiME Speech Separation and Recognition
Challenge. Our system combines speech enhancement by supervised sparse non-negative …

Recognition of overlapping speech using digital MEMS microphone arrays

E Zwyssig, F Faubel, S Renals… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
This paper presents a new corpus comprising single and overlapping speech recorded
using digital MEMS and analogue microphone arrays. In addition to this, the paper presents …

Integration of beamforming and uncertainty-of-observation techniques for robust ASR in multi-source environments

RF Astudillo, D Kolossa, A Abad, S Zeiler… - Computer Speech & …, 2013 - Elsevier
This paper presents a new approach for increasing the robustness of multi-channel
automatic speech recognition in noisy and reverberant multi-source environments. The …

Binaural scene analysis with multidimensional statistical filters

C Spille, BT Meyer, M Dietz, V Hohmann - The technology of binaural …, 2013 - Springer
The segregation of concurrent speakers and other sound sources is an important aspect in
improving the performance of audio technology, such as noise reduction and automatic …