Remote sensing object detection in the deep learning era—a review

S Gui, S Song, R Qin, Y Tang - Remote Sensing, 2024 - mdpi.com
Given the large volume of remote sensing images collected daily, automatic object detection
and segmentation have been a consistent need in Earth observation (EO). However, objects …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Vision transformers need registers

T Darcet, M Oquab, J Mairal, P Bojanowski - arXiv preprint arXiv …, 2023 - arxiv.org
Transformers have recently emerged as a powerful tool for learning visual representations.
In this paper, we identify and characterize artifacts in feature maps of both supervised and …

Self-supervised object-centric learning for videos

G Aydemir, W Xie, F Guney - Advances in Neural …, 2023 - proceedings.neurips.cc
Unsupervised multi-object segmentation has shown impressive results on images by
utilizing powerful semantics learned from self-supervised pretraining. An additional modality …

Zero-shot referring image segmentation with global-local context features

S Yu, PH Seo, J Son - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Referring image segmentation (RIS) aims to find a segmentation mask given a referring
expression grounded to a region of the input image. Collecting labelled datasets for this …

Proxyclip: Proxy attention improves clip for open-vocabulary segmentation

M Lan, C Chen, Y Ke, X Wang, L Feng… - European Conference on …, 2025 - Springer
Open-vocabulary semantic segmentation requires models to effectively integrate visual
representations with open-vocabulary semantic labels. While Contrastive Language-Image …

Diffusion models for zero-shot open-vocabulary segmentation

L Karazija, I Laina, A Vedaldi, C Rupprecht - arXiv preprint arXiv …, 2023 - arxiv.org
The variety of objects in the real world is nearly unlimited and is thus impossible to capture
using models trained on a fixed set of categories. As a result, in recent years, open …

Unsupervised universal image segmentation

D Niu, X Wang, X Han, L Lian… - Proceedings of the …, 2024 - openaccess.thecvf.com
Several unsupervised image segmentation approaches have been proposed which
eliminate the need for dense manually-annotated segmentation masks; current models …

Time does tell: Self-supervised time-tuning of dense image representations

M Salehi, E Gavves, CGM Snoek… - Proceedings of the …, 2023 - openaccess.thecvf.com
Spatially dense self-supervised learning is a rapidly growing problem domain with
promising applications for unsupervised segmentation and pretraining for dense …

Object-centric slot diffusion

J Jiang, F Deng, G Singh, S Ahn - arXiv preprint arXiv:2303.10834, 2023 - arxiv.org
The recent success of transformer-based image generative models in object-centric learning
highlights the importance of powerful image generators for handling complex scenes …