Integrating source-channel and attention-based sequence-to-sequence models for speech recognition

Q Li, C Zhang, PC Woodland - … Automatic Speech Recognition …, 2019 - ieeexplore.ieee.org
… novel automatic speech recognition (ASR) framework called Integrated Source-Channel and
… of traditional systems based on the noisy source-channel model (SC) and end-to-end style …

Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition

D Bagchi, MI Mandel, Z Wang, Y He… - … Speech Recognition …, 2015 - ieeexplore.ieee.org
… applying a model-based source separation mask to the output of a beamformer that combines
the source signals recorded … The source separation algorithm MESSL (Model-based EM …

Universal speech models for speaker independent single channel source separation

DL Sun, GJ Mysore - … Acoustics, Speech and Signal Processing, 2013 - ieeexplore.ieee.org
… a universal speech model from a general corpus of speech and … model to separate speech
from other sound sources. This model is used in lieu of a speech model trained on speaker-…

[PDF][PDF] Improved source modeling and predictive classification for channel robust speech recognition.

V Ion, R Haeb-Umbach - INTERSPEECH, 2006 - masters.donntu.ru
… In this paper the Bayesian framework of speech recognition was reformulated for the
server side of a distributed system. This resulted in a predictive decision rule which was …

Single-channel multitalker speech recognition

SJ Rennie, JR Hershey… - IEEE Signal Processing …, 2010 - ieeexplore.ieee.org
… -based methods, use explicit models of the speech sources to separate them [7]–[10]. The
model for each source in a given frame (a small window of speech of approximately 40 ms) is …

MIMO-Speech: End-to-end multi-channel multi-speaker speech recognition

X Chang, W Zhang, Y Qian, J Le Roux… - … Speech Recognition …, 2019 - ieeexplore.ieee.org
… -separation by predicting multiple speaker and noise masks for each channel. Then a
multi-source neural beamformer is used to spatially separate multiple speaker sources. In the last …

[PDF][PDF] Automatic speech recognition system channel modeling.

QF Tan, K Audhkhasi, PG Georgiou, E Ettelaie… - Interspeech, 2010 - isca-archive.org
… for channel modeling of an Automatic Speech Recognition (ASR) system. This can have
implications in improving speech recognition … of the source language in the translation model. …

[PDF][PDF] A source model mitigation technique for distributed speech recognition over lossy packet channels.

ÁM Gómez, AM Peinado, VE Sánchez… - INTERSPEECH, 2003 - researchgate.net
… technique for a distributed speech recognition system over IP. … of the information contained
in the data-source, because, in IP … previous and next received speech vector sequences, we …

[PDF][PDF] Multi-channel speech recognition: LSTMs all the way through

H Erdogan, T Hayashi, JR Hershey, T Hori… - CHiME-4 …, 2016 - groups.csail.mit.edu
… threat” system for speech recognition, where LSTMs drive … array processing, acoustic
modeling, and language modeling. This … source was significantly larger than the other source. For …

Speech recognition in unseen and noisy channel conditions

V Mitra, H Franco, C Bartels, J van Hout… - … Signal Processing  …, 2017 - ieeexplore.ieee.org
… (AM) training, we used approximately 250 hours of retransmitted conversational speech (LDC2011E111
and LDC2011E93). For language model (LM) training, we used various sources