查看文章

ntua.gr 中的 [PDF]

Adaptive multimodal fusion by uncertainty compensation

作者

Vassilis Pitsikalis, Athanassios Katsamanis, George Papandreou, Petros Maragos

发表日期

2006

研讨会论文

Ninth International Conference on Spoken Language Processing

简介

While the accuracy of feature measurements heavily depends on changing environmental conditions, studying the consequences of this fact in pattern recognition tasks has received relatively little attention to date. In this work we explicitly take into account feature measurement uncertainty and we show how classification rules should be adjusted to compensate for its effects. Our approach is particularly fruitful in multimodal fusion scenarios, such as audiovisual speech recognition, where multiple streams of complementary time-evolving features are integrated. For such applications, provided that the measurement noise uncertainty for each feature stream can be estimated, the proposed framework leads to highly adaptive multimodal fusion rules which are widely applicable and easy to implement. We further show that previous multimodal fusion methods relying on stream weights fall under our scheme under certain assumptions; this provides novel insights into their applicability for various tasks and suggests new practical ways for estimating the stream weights adaptively. The potential of our approach is demonstrated in audio-visual speech recognition using either synchronous or asynchronous models.

引用总数

被引用次数：46

20052006200720082009201020112012201320142015201620172018201920202021202220231 1 3 4 1 2 1 5 7 5 3 3 3 2 2 2 1

学术搜索中的文章

Adaptive multimodal fusion by uncertainty compensation.

V Pitsikalis, A Katsamanis, G Papandreou, P Maragos - INTERSPEECH, 2006

被引用次数：46 相关文章所有 17 个版本