Y Wang,
P Sun,
Y Li, H Zhang, D Hu - arXiv preprint arXiv:2407.10947, 2024 - arxiv.org
The Audio-Visual Segmentation (AVS) task aims to segment sounding objects in the visual
space using audio cues. However, in this work, it is recognized that previous AVS methods …