- 学术资源搜索

Vision-language models for vision tasks: A survey

J Zhang, J Huang, S Jin, S Lu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

Most visual recognition studies rely heavily on crowd-labelled data in deep neural networks
(DNNs) training, and they usually train a DNN for each single visual recognition task …

被引用次数：155 相关文章所有 9 个版本

[PDF] thecvf.com

Ulip: Learning a unified representation of language, images, and point clouds for 3d understanding

L Xue, M Gao, C Xing, R Martín-Martín… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recognition capabilities of current state-of-the-art 3D models are limited by datasets with
a small number of annotated data and a pre-defined set of categories. In its 2D counterpart …

被引用次数：157 相关文章所有 6 个版本

[PDF] thecvf.com

A simple framework for open-vocabulary segmentation and detection

H Zhang, F Li, X Zou, S Liu, C Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this work, we present OpenSeeD, a simple Open-vocabulary Segmentation and Detection
framework that learns from different segmentation and detection datasets. To bridge the gap …

被引用次数：100 相关文章所有 5 个版本

[PDF] thecvf.com

Aligning bag of regions for open-vocabulary object detection

S Wu, W Zhang, S Jin, W Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Pre-trained vision-language models (VLMs) learn to align vision and language
representations on large-scale datasets, where each image-text pair usually contains a bag …

被引用次数：62 相关文章所有 5 个版本

[PDF] thecvf.com

Cora: Adapting clip for open-vocabulary detection with region prompting and anchor pre-matching

X Wu, F Zhu, R Zhao, H Li - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com

Open-vocabulary detection (OVD) is an object detection task aiming at detecting objects
from novel categories beyond the base categories on which the detector is trained. Recent …

被引用次数：59 相关文章所有 5 个版本

[PDF] neurips.cc

Codet: Co-occurrence guided region-word alignment for open-vocabulary object detection

C Ma, Y Jiang, X Wen, Z Yuan… - Advances in neural …, 2024 - proceedings.neurips.cc

Deriving reliable region-word alignment from image-text pairs is critical to learnobject-level
vision-language representations for open-vocabulary object detection. Existing methods …

被引用次数：23 相关文章所有 6 个版本

[PDF] ieee.org

Towards open vocabulary learning: A survey

J Wu, X Li, S Xu, H Yuan, H Ding… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …

被引用次数：64 相关文章所有 10 个版本

[PDF] thecvf.com

V3det: Vast vocabulary visual detection dataset

J Wang, P Zhang, T Chu, Y Cao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent advances in detecting arbitrary objects in the real world are trained and evaluated
on object detection datasets with a relatively restricted vocabulary. To facilitate the …

被引用次数：36 相关文章所有 5 个版本

[PDF] arxiv.org

A survey on open-vocabulary detection and segmentation: Past, present, and future

C Zhu, L Chen - IEEE Transactions on Pattern Analysis and …, 2024 - ieeexplore.ieee.org

As the most fundamental scene understanding tasks, object detection and segmentation
have made tremendous progress in deep learning era. Due to the expensive manual …

被引用次数：10 相关文章所有 7 个版本

[PDF] thecvf.com

Learning background prompts to discover implicit knowledge for open vocabulary object detection

J Li, J Zhang, J Li, G Li, S Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Open vocabulary object detection (OVD) aims at seeking an optimal object detector capable
of recognizing objects from both base and novel categories. Recent advances leverage …

被引用次数：4 相关文章所有 3 个版本

高级搜索

QQ 群

Vision-language models for vision tasks: A survey

Ulip: Learning a unified representation of language, images, and point clouds for 3d understanding

A simple framework for open-vocabulary segmentation and detection

Aligning bag of regions for open-vocabulary object detection

Cora: Adapting clip for open-vocabulary detection with region prompting and anchor pre-matching

Codet: Co-occurrence guided region-word alignment for open-vocabulary object detection

Towards open vocabulary learning: A survey

V3det: Vast vocabulary visual detection dataset

A survey on open-vocabulary detection and segmentation: Past, present, and future

Learning background prompts to discover implicit knowledge for open vocabulary object detection

引用