OpenDAS: Domain Adaptation for Open-Vocabulary Segmentation

G Yilmaz, S Peng, F Engelmann, M Pollefeys… - arXiv preprint arXiv …, 2024 - arxiv.org
The advent of Vision Language Models (VLMs) transformed image understanding from
closed-set classifications to dynamic image-language interactions, enabling open …

Self-guided open-vocabulary semantic segmentation

O Ülger, M Kulicki, Y Asano, MR Oswald - arXiv preprint arXiv:2312.04539, 2023 - arxiv.org
Vision-Language Models (VLMs) have emerged as promising tools for open-ended image
understanding tasks, including open vocabulary segmentation. Yet, direct application of …

Global knowledge calibration for fast open-vocabulary segmentation

K Han, Y Liu, JH Liew, H Ding, J Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent advancements in pre-trained vision-language models, such as CLIP, have enabled
the segmentation of arbitrary concepts solely from textual inputs, a process commonly …

Transferable and Principled Efficiency for Open-Vocabulary Segmentation

J Xu, W Chen, Y Zhao, Y Wei - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Recent success of pre-trained foundation vision-language models makes Open-Vocabulary
Segmentation (OVS) possible. Despite the promising performance this approach introduces …

Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models

J Luo, S Khandelwal, L Sigal… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
From image-text pairs large-scale vision-language models (VLMs) learn to implicitly
associate image regions with words which prove effective for tasks like visual question …

Plug-and-Play, Dense-Label-Free Extraction of Open-Vocabulary Semantic Segmentation from Vision-Language Models

L Jiayun, S Khandelwal, L Sigal, B Li - arXiv preprint arXiv:2311.17095, 2023 - arxiv.org
From an enormous amount of image-text pairs, large-scale vision-language models (VLMs)
learn to implicitly associate image regions with words, which is vital for tasks such as image …

Open-vocabulary segmentation with semantic-assisted calibration

Y Liu, S Bai, G Li, Y Wang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
This paper studies open-vocabulary segmentation (OVS) through calibrating in-vocabulary
and domain-biased embedding space with generalized contextual prior of CLIP. As the core …

Scaling open-vocabulary image segmentation with image-level labels

G Ghiasi, X Gu, Y Cui, TY Lin - European Conference on Computer Vision, 2022 - Springer
We design an open-vocabulary image segmentation model to organize an image into
meaningful regions indicated by arbitrary texts. Recent works (CLIP and ALIGN), despite …

A simple baseline for open-vocabulary semantic segmentation with pre-trained vision-language model

M Xu, Z Zhang, F Wei, Y Lin, Y Cao, H Hu… - European Conference on …, 2022 - Springer
Recently, open-vocabulary image classification by vision language pre-training has
demonstrated incredible achievements, that the model can classify arbitrary categories …

TAG: Guidance-free Open-Vocabulary Semantic Segmentation

Y Kawano, Y Aoki - arXiv preprint arXiv:2403.11197, 2024 - arxiv.org
Semantic segmentation is a crucial task in computer vision, where each pixel in an image is
classified into a category. However, traditional methods face significant challenges …