查看文章

eurasip.org 中的 [PDF]

An Efficient Audiovisual Saliency Model to Predict Eye Positions When Looking at Conversations

作者

Antoine Coutrot, Nathalie Guyader

发表日期

2015

研讨会论文

European Conference on Signal Processing (EUSIPCO)

简介

Classic models of visual attention dramatically fail at predicting eye positions on visual scenes involving faces. While some recent models combine faces with low-level features, none of them consider sound as an input. Yet it is crucial in conversation or meeting scenes. In this paper, we describe and refine an audiovisual saliency model for conversation scenes. This model includes a speaker diarization algorithm which automatically modulates the saliency of conversation partners' faces and bodies according to their speaking-or-not status. To merge our different features into a master saliency map, we use an efficient statistical method (Lasso) allowing a straightforward interpretation of feature relevance. To train and evaluate our model, we run an eye tracking experiment on a publicly available meeting videobase. We show that increasing the saliency of speakers' faces (but not bodies) greatly improves the …

引用总数

被引用次数：30

2016201720182019202020212022202320241 3 2 2 6 5 5 4 2

学术搜索中的文章

An efficient audiovisual saliency model to predict eye positions when looking at conversations

A Coutrot, N Guyader - 2015 23rd European signal processing conference …, 2015

被引用次数：30 相关文章所有 8 个版本