A multimodal saliency model for videos with high audio-visual correspondence

X Min, G Zhai, J Zhou, XP Zhang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Audio information has been bypassed by most of current visual attention prediction studies.
However, sound could have influence on visual attention and such influence has been …

[图书][B] Accessible filmmaking: Integrating translation and accessibility into the filmmaking process

P Romero-Fresco - 2019 - taylorfrancis.com
Translation, accessibility and the viewing experience of foreign, deaf and blind audiences
has long been a neglected area of research within film studies. The same applies to the film …

Vinet: Pushing the limits of visual modality for audio-visual saliency prediction

S Jain, P Yarlagadda, S Jyoti, S Karthik… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
We propose the ViNet architecture for audio-visual saliency prediction. ViNet is a fully
convolutional encoder-decoder architecture. The encoder uses visual features from a …

[HTML][HTML] How saliency, faces, and sound influence gaze in dynamic social scenes

A Coutrot, N Guyader - Journal of vision, 2014 - iovs.arvojournals.org
Conversation scenes are a typical example in which classical models of visual attention
dramatically fail to predict eye positions. Indeed, these models rarely consider faces as …

Fixation prediction through multimodal analysis

X Min, G Zhai, K Gu, X Yang - ACM Transactions on Multimedia …, 2016 - dl.acm.org
In this article, we propose to predict human eye fixation through incorporating both audio
and visual cues. Traditional visual attention models generally make the utmost of stimuli's …

Scanpath modeling and classification with hidden Markov models

A Coutrot, JH Hsiao, AB Chan - Behavior research methods, 2018 - Springer
How people look at visual information reveals fundamental information about them; their
interests and their states of mind. Previous studies showed that scanpath, ie, the sequence …

[图书][B] Cognitive media theory

T Nannicelli, P Taberham - 2014 - api.taylorfrancis.com
Across the academy, scholars are debating the question of what bearing scientific inquiry
has upon the humanities. The latest addition to the AFI Film Readers series, Cognitive …

[HTML][HTML] Face exploration dynamics differentiate men and women

A Coutrot, N Binetti, C Harrison, I Mareschal… - Journal of …, 2016 - jov.arvojournals.org
The human face is central to our everyday social interactions. Recent studies have shown
that while gazing at faces, each one of us has a particular eye-scanning pattern, highly …

360-degree video gaze behaviour: A ground-truth data set and a classification algorithm for eye movements

I Agtzidis, M Startsev, M Dorr - Proceedings of the 27th ACM international …, 2019 - dl.acm.org
Eye tracking and the analysis of gaze behaviour are established tools to produce insights
into how humans observe their surroundings and consume visual multimedia content. For …

[HTML][HTML] How soundtracks shape what we see: Analyzing the influence of music on visual scenes through self-assessment, eye tracking, and pupillometry

A Ansani, M Marini, F D'Errico, I Poggi - Frontiers in Psychology, 2020 - frontiersin.org
This article presents two studies that deepen the theme of how soundtracks shape our
interpretation of audiovisuals. Embracing a multivariate perspective, Study 1 (N= 118) …