Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition

MR Schädler, BT Meyer, B Kollmeier - The Journal of the Acoustical …, 2012 - pubs.aip.org
In an attempt to increase the robustness of automatic speech recognition (ASR) systems, a
feature extraction scheme is proposed that takes spectro-temporal modulation frequencies …

Robustness of spectro-temporal features against intrinsic and extrinsic variations in automatic speech recognition

BT Meyer, B Kollmeier - Speech Communication, 2011 - Elsevier
The effect of bio-inspired spectro-temporal processing for automatic speech recognition
(ASR) is analyzed for two different tasks with focus on the robustness of spectro-temporal …

Audio proto objects for improved sound localization

T Rodemann, F Joublin… - 2009 IEEE/RSJ …, 2009 - ieeexplore.ieee.org
In this article we present a new framework for auditory processing that combines feature
extraction and grouping processes to form what we call audio proto objects. These proto …

Steps towards more natural human-machine interaction via audio-visual word prominence detection

M Heckmann - … Workshop on Multimodal Analyses Enabling Artificial …, 2014 - Springer
We investigate how word prominence can be detected from the acoustic signal and
movements of the speaker's head and mouth. Our research is based on a corpus with 12 …

Teaching a humanoid robot: Headset-free speech interaction for audio-visual association learning

M Heckmann, H Brandl… - RO-MAN 2009-The …, 2009 - ieeexplore.ieee.org
Based on inspirations from infant development we present a system which learns
associations between acoustic labels and visual representations in interaction with its tutor …

Human and automatic speech recognition in the presence of speech-intrinsic variations

BT Meyer - 2009 - oops.uni-oldenburg.de
Despite several decades of research, automatic speech recognition (ASR) lacks the
performance achieved by human listeners. One of the major challenges in ASR is to cope …

Filtering environmental sounds using basic audio cues in robot audition

T Rodemann, F Joublin… - … Conference on Advanced …, 2009 - ieeexplore.ieee.org
In this article we present an approach for separating robot-directed speech from
environmental sounds for applications in robot audition under high noise conditions. We …

[PDF][PDF] Supervised vs. unsupervised learning of spectro temporal speech features

M Heckmann - Statistical And Perceptual Audition 2010, 2010 - isca-archive.org
To overcome limitations of purely spectral speech features we previously introduced
Hierarchical Spectro-Temporal (HIST) features. We could show that a combination of HIST …

[图书][B] Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features

MR Schädler - 2016 - oops.uni-oldenburg.de
Automatic speech recognition (ASR) systems still do not perform as well as human listeners
under realistic conditions. The unmatched ability of humans to understand speech in most …

[PDF][PDF] An audio-visual attention system for online association learning

M Heckmann, H Brandl, X Domont, B Bolder… - … Annual Conference of …, 2009 - honda-ri.de
We present an audio-visual attention system for speech based interaction with a humanoid
robot where a tutor can teach visual properties/locations (eg” left”) and corresponding …