J Xiong, G Wang, P Zhang, W Huang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Incorporating the audio stream enables Video Saliency Prediction (VSP) to imitate the
selective attention mechanism of human brain. By focusing on the benefits of joint auditory …