Advances in deep concealed scene understanding

DP Fan, GP Ji, P Xu, MM Cheng, C Sakaridis… - Visual Intelligence, 2023 - Springer
Concealed scene understanding (CSU) is a hot computer vision topic aiming to perceive
objects exhibiting camouflage. The current boom in terms of techniques and applications …

Reco: Retrieve and co-segment for zero-shot transfer

G Shin, W Xie, S Albanie - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Semantic segmentation has a broad range of applications, but its real-world impact has
been significantly limited by the prohibitive annotation costs necessary to enable …

Tokencut: Segmenting objects in images and videos with self-supervised transformer and normalized cut

Y Wang, X Shen, Y Yuan, Y Du, M Li… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
In this paper, we describe a graph-based algorithm that uses the features obtained by a self-
supervised transformer to detect and segment salient objects in images and videos. With this …

Diffusion models for zero-shot open-vocabulary segmentation

L Karazija, I Laina, A Vedaldi, C Rupprecht - arXiv preprint arXiv …, 2023 - arxiv.org
The variety of objects in the real world is nearly unlimited and is thus impossible to capture
using models trained on a fixed set of categories. As a result, in recent years, open …

Unsupervised object localization: Observing the background to discover objects

O Siméoni, C Sekkat, G Puy… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent advances in self-supervised visual representation learning have paved the way for
unsupervised methods tackling tasks such as object discovery and instance segmentation …

Open-vocabulary semantic segmentation with frozen vision-language models

C Ma, Y Yang, Y Wang, Y Zhang, W Xie - arXiv preprint arXiv:2210.15138, 2022 - arxiv.org
When trained at a sufficient scale, self-supervised learning has exhibited a notable ability to
solve a wide range of visual or language understanding tasks. In this paper, we investigate …

Texture-guided saliency distilling for unsupervised salient object detection

H Zhou, B Qiao, L Yang, J Lai… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Deep Learning-based Unsupervised Salient Object Detection (USOD) mainly relies
on the noisy saliency pseudo labels that have been generated from traditional handcraft …

Paintseg: Painting pixels for training-free segmentation

X Li, CC Lin, Y Chen, Z Liu, J Wang… - Advances in Neural …, 2024 - proceedings.neurips.cc
The paper introduces PaintSeg, a new unsupervised method for segmenting objects without
any training. We propose an adversarial masked contrastive painting (AMCP) process …

In defense of lazy visual grounding for open-vocabulary semantic segmentation

D Kang, M Cho - European Conference on Computer Vision, 2025 - Springer
Abstract We present Lazy Visual Grounding for open-vocabulary semantic segmentation,
which decouples unsupervised object mask discovery from object grounding. Plenty of the …

Guess what moves: Unsupervised video and image segmentation by anticipating motion

S Choudhury, L Karazija, I Laina, A Vedaldi… - arXiv preprint arXiv …, 2022 - arxiv.org
Motion, measured via optical flow, provides a powerful cue to discover and learn objects in
images and videos. However, compared to using appearance, it has some blind spots, such …