Audiovisual segmentation

J Zhou, J Wang, J Zhang, W Sun, J Zhang… - … on Computer Vision, 2022 - Springer
… the audio semantics for guiding visual segmentation. We … of audio-visual signals, which
further enhances segmentation … of considering audio signals for visual segmentation, and the …

Audio-visual segmentation with semantics

J Zhou, X Shen, J Wang, J Zhang, W Sun… - International Journal of …, 2024 - Springer
audio semantics for guiding visual segmentation. … audio-visual signals, which further enhances
segmentation performance. At last, we remind readers that the audio-visual segmentation

Avsegformer: Audio-visual segmentation with transformer

S Gao, Z Chen, G Chen, W Wang, T Lu - Proceedings of the AAAI …, 2024 - ojs.aaai.org
… In this paper, we propose AVSegFormer, a novel audiovisual segmentation framework that
leverages the power of transformer architecture to achieve leading performance. Specifically, …

Annotation-free audio-visual segmentation

J Liu, Y Wang, C Ju, C Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com
audio-visual segmentation (AVS), by leveraging off-the-shelf large image segmentation
datasets and audio … Besides, we develop an effective method SAMA-AVS to adapt segmentation

Improving audio-visual segmentation with bidirectional generation

D Hao, Y Mao, B He, X Han, Y Dai… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
… In this paper, we introduce audio-visualaudio loop consistency constraints to build strong
correlations between audio and segmented masks and strengthen audio supervision …

Audio content analysis for online audiovisual data segmentation and classification

T Zhang, CCJ Kuo - IEEE Transactions on speech and audio …, 2001 - ieeexplore.ieee.org
… of songs performed by Michael Jackson” may be achieved by searching for audio index of “…
segmenting and classifying accompanying audio signals in audiovisual data based on audio

Weakly-supervised audio-visual segmentation

S Mo, B Raj - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
… multi-scale audiovisual features, we apply … audio-visual segmentation baseline [1] does
not give any cross-modal constraint on the audio and visual representations in the audio-visual

BAVS: bootstrapping audio-visual segmentation by integrating foundation knowledge

C Liu, P Li, H Zhang, L Li, Z Huang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
… • We introduce a novel two-stage bootstrapping audiovisual segmentation framework (BAVS).
Our framework shows strong robustness against background noise and off-screen sounds. …

Multimodal variational auto-encoder based audio-visual segmentation

Y Mao, J Zhang, M Xiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
segmentation masks. Zhou et al. [1] propose an AVSBench dataset for audiovisual segmentation
and provide a simple baseline based on temporal pixel-wise audio-visual interaction (…

Self-supervised audio-visual co-segmentation

A Rouditchenko, H Zhao, C Gan… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
… In this paper we develop a neural network model for visual object segmentation and sound
… independent image segmentation and sound source separation after audio-visual training on …