C Chen, M Song, W Song, L Guo… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video saliency detection (VSD) aims at fast locating the most attractive objects/things/patterns in a given video clip. Existing VSD-related works have mainly relied …
We propose the ViNet architecture for audio-visual saliency prediction. ViNet is a fully convolutional encoder-decoder architecture. The encoder uses visual features from a …
We introduce STAViS, a spatio-temporal audiovisual saliency network that combines spatio- temporal visual and auditory information in order to efficiently address the problem of …
Z Yang, S Soltanian-Zadeh, S Farsiu - Pattern recognition, 2022 - Elsevier
Salient object detection (SOD) is viewed as a pixel-wise saliency modeling task by traditional deep learning-based methods. A limitation of current SOD models is insufficient …
In recent years, several authors have reported that spectral saliency detection methods provide state-of-the-art performance in predicting human gaze in images (see, eg,[1–3]). We …
Egocentric gaze anticipation serves as a key building block for the emerging capability of Augmented Reality. Notably, gaze behavior is driven by both visual cues and audio signals …
ML Needham, KL Baum, F Ishtiaq, R Li… - US Patent …, 2017 - Google Patents
A method implemented in a computer system for controlling the delivery of data and audio/video content. The method delivers primary content to the subscriber device for …
D Zhu, X Shao, Q Zhou, X Min, G Zhai… - ACM Transactions on …, 2023 - dl.acm.org
Audio information has not been considered an important factor in visual attention models regardless of many psychological studies that have shown the importance of audio …
The Transformer revolutionized Natural Language Processing and Computer Vision by effectively capturing contextual relationships in sequential data through its attention …