作者
Carlos Busso, Sergi Hernanz, Chi-Wei Chu, Soon-il Kwon, Sung Lee, Panayiotis G Georgiou, Isaac Cohen, Shrikanth Narayanan
发表日期
2005/3/23
研讨会论文
Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
卷号
2
页码范围
ii/1117-ii/1120 Vol. 2
出版商
IEEE
简介
Our long-term objective is to create smart room technologies that are aware of the users presence and their behavior and can become an active, but not an intrusive, part of the interaction. In this work, we present a multimodal approach for estimating and tracking the location and identity of the participants including the active speaker. Our smart room design contains three user-monitoring systems: four CCD cameras, an omnidirectional camera and a 16 channel microphone array. The various sensory modalities are processed both individually and jointly and it is shown that the multimodal approach results in significantly improved performance in spatial localization, identification and speech activity detection of the participants.
引用总数
20052006200720082009201020112012201320142015201620172018201920202021202220232024141073894238678557343
学术搜索中的文章
C Busso, S Hernanz, CW Chu, S Kwon, S Lee… - … .(ICASSP'05). IEEE International Conference on …, 2005