Recent advances in the automatic recognition of audiovisual speech

G Potamianos, C Neti, G Gravier, A Garg… - Proceedings of the …, 2003 - ieeexplore.ieee.org
Visual speech information from the speaker's mouth region has been successfully shown to
improve noise robustness of automatic speech recognizers, thus promising to extend their …

A review of recent advances in visual speech decoding

Z Zhou, G Zhao, X Hong, M Pietikäinen - Image and vision computing, 2014 - Elsevier
Visual speech information plays an important role in automatic speech recognition (ASR)
especially when audio is corrupted or even inaccessible. Despite the success of audio …

Automatic classification of heartbeats using ECG morphology and heartbeat interval features

P De Chazal, M O'Dwyer… - IEEE transactions on …, 2004 - ieeexplore.ieee.org
A method for the automatic processing of the electrocardiogram (ECG) for the classification
of heartbeats is presented. The method allocates manually detected heartbeats to one of the …

A patient-adapting heartbeat classifier using ECG morphology and heartbeat interval features

P De Chazal, RB Reilly - IEEE transactions on biomedical …, 2006 - ieeexplore.ieee.org
An adaptive system for the automatic processing of the electrocardiogram (ECG) for the
classification of heartbeats into one of the five beat classes recommended by ANSI/AAMI …

Audio-visual deep learning for noise robust speech recognition

J Huang, B Kingsbury - 2013 IEEE international conference on …, 2013 - ieeexplore.ieee.org
Deep belief networks (DBN) have shown impressive improvements over Gaussian mixture
models for automatic speech recognition. In this work we use DBNs for audio-visual speech …

Audiovisual fusion: Challenges and new approaches

AK Katsaggelos, S Bahaadini… - Proceedings of the …, 2015 - ieeexplore.ieee.org
In this paper, we review recent results on audiovisual (AV) fusion. We also discuss some of
the challenges and report on approaches to address them. One important issue in AV fusion …

Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition

G Papandreou, A Katsamanis… - … on Audio, Speech …, 2009 - ieeexplore.ieee.org
While the accuracy of feature measurements heavily depends on changing environmental
conditions, studying the consequences of this fact in pattern recognition tasks has received …

Robust audio-visual speech recognition under noisy audio-video conditions

D Stewart, R Seymour, A Pass… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
This paper presents the maximum weighted stream posterior (MWSP) model as a robust and
efficient stream integration method for audio-visual speech recognition in environments …

New entropy based combination rules in HMM/ANN multi-stream ASR

H Misra, H Bourlard, V Tyagi - 2003 IEEE International …, 2003 - ieeexplore.ieee.org
Classifier performance is often enhanced through combining multiple streams of information.
In the context of multi-stream HMM/ANN systems in ASR, a confidence measure widely used …

Learning dynamic stream weights for coupled-HMM-based audio-visual speech recognition

AH Abdelaziz, S Zeiler… - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
With the increasing use of multimedia data in communication technologies, the idea of
employing visual information in automatic speech recognition (ASR) has recently gathered …