Self-supervised object-centric learning for videos

G Aydemir, W Xie, F Guney - Advances in Neural …, 2023 - proceedings.neurips.cc
Unsupervised multi-object segmentation has shown impressive results on images by
utilizing powerful semantics learned from self-supervised pretraining. An additional modality …

Object-centric learning for real-world videos by predicting temporal feature similarities

A Zadaianchuk, M Seitzer… - Advances in Neural …, 2024 - proceedings.neurips.cc
Unsupervised video-based object-centric learning is a promising avenue to learn structured
representations from large, unlabeled video collections, but previous approaches have only …

Unsupervised object localization in the era of self-supervised vits: A survey

O Siméoni, É Zablocki, S Gidaris, G Puy… - International Journal of …, 2024 - Springer
The recent enthusiasm for open-world vision systems show the high interest of the
community to perform perception tasks outside of the closed-vocabulary benchmark setups …

SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

I Kakogeorgiou, S Gidaris… - Proceedings of the …, 2024 - openaccess.thecvf.com
Unsupervised object-centric learning aims to decompose scenes into interpretable object
entities termed slots. Slot-based auto-encoders stand out as a prominent method for this …

CarFormer: Self-Driving with Learned Object-Centric Representations

S Hamdan, F Güney - European Conference on Computer Vision, 2025 - Springer
The choice of representation plays a key role in self-driving. Bird's eye view (BEV)
representations have shown remarkable performance in recent years. In this paper, we …

DIOD: Self-Distillation Meets Object Discovery

S Kara, H Ammar, J Denize… - Proceedings of the …, 2024 - openaccess.thecvf.com
Instance segmentation demands substantial labeling resources. This has prompted
increased interest to explore the object discovery task as an unsupervised alternative. In …

The Background Also Matters: Background-Aware Motion-Guided Objects Discovery

S Kara, H Ammar, F Chabot… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Recent works have shown that objects discovery can largely benefit from the inherent
motion information in video data. However, these methods lack a proper background …

On Moving Object Segmentation from Monocular Video with Transformers

C Homeyer, C Schnörr - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Moving object detection and segmentation from a single moving camera is a challenging
task, requiring an understanding of recognition, motion and 3D geometry. Combining both …

Spotlight attention: Robust object-centric learning with a spatial locality prior

A Chakravarthy, T Nguyen, A Goyal, Y Bengio… - arXiv preprint arXiv …, 2023 - arxiv.org
The aim of object-centric vision is to construct an explicit representation of the objects in a
scene. This representation is obtained via a set of interchangeable modules called\emph …

Divided attention: Unsupervised multi-object discovery with contextually separated slots

D Lao, Z Hu, F Locatello, Y Yang, S Soatto - arXiv preprint arXiv …, 2023 - arxiv.org
We introduce a method to segment the visual field into independently moving regions,
trained with no ground truth or supervision. It consists of an adversarial conditional encoder …