The effect of bio-inspired spectro-temporal processing for automatic speech recognition (ASR) is analyzed for two different tasks with focus on the robustness of spectro-temporal …
In this article we present a new framework for auditory processing that combines feature extraction and grouping processes to form what we call audio proto objects. These proto …
M Heckmann - … Workshop on Multimodal Analyses Enabling Artificial …, 2014 - Springer
We investigate how word prominence can be detected from the acoustic signal and movements of the speaker's head and mouth. Our research is based on a corpus with 12 …
M Heckmann, H Brandl… - RO-MAN 2009-The …, 2009 - ieeexplore.ieee.org
Based on inspirations from infant development we present a system which learns associations between acoustic labels and visual representations in interaction with its tutor …
Despite several decades of research, automatic speech recognition (ASR) lacks the performance achieved by human listeners. One of the major challenges in ASR is to cope …
T Rodemann, F Joublin… - … Conference on Advanced …, 2009 - ieeexplore.ieee.org
In this article we present an approach for separating robot-directed speech from environmental sounds for applications in robot audition under high noise conditions. We …
M Heckmann - Statistical And Perceptual Audition 2010, 2010 - isca-archive.org
To overcome limitations of purely spectral speech features we previously introduced Hierarchical Spectro-Temporal (HIST) features. We could show that a combination of HIST …
Automatic speech recognition (ASR) systems still do not perform as well as human listeners under realistic conditions. The unmatched ability of humans to understand speech in most …
M Heckmann, H Brandl, X Domont, B Bolder… - … Annual Conference of …, 2009 - honda-ri.de
We present an audio-visual attention system for speech based interaction with a humanoid robot where a tutor can teach visual properties/locations (eg” left”) and corresponding …