Towards open vocabulary learning: A survey

J Wu, X Li, S Xu, H Yuan, H Ding… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …

A survey on open-vocabulary detection and segmentation: Past, present, and future

C Zhu, L Chen - IEEE Transactions on Pattern Analysis and …, 2024 - ieeexplore.ieee.org
As the most fundamental scene understanding tasks, object detection and segmentation
have made tremendous progress in deep learning era. Due to the expensive manual …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - arXiv preprint arXiv …, 2023 - arxiv.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Open3dis: Open-vocabulary 3d instance segmentation with 2d mask guidance

P Nguyen, TD Ngo, E Kalogerakis… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce Open3DIS a novel solution designed to tackle the problem of Open-
Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments …

Mamo: Leveraging memory and attention for monocular video depth estimation

R Yasarla, H Cai, J Jeong, Y Shi… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose MAMo, a novel memory and attention framework for monocular video depth
estimation. MAMo can augment and improve any single-image depth estimation networks …

Maskclustering: View consensus based mask graph clustering for open-vocabulary 3d instance segmentation

M Yan, J Zhang, Y Zhu, H Wang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Open-vocabulary 3D instance segmentation is cutting-edge for its ability to segment 3D
instances without predefined categories. However progress in 3D lags behind its 2D …

Lp-ovod: Open-vocabulary object detection by linear probing

C Pham, T Vu, K Nguyen - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
This paper addresses the challenging problem of open-vocabulary object detection (OVOD)
where an object detector must identify both seen and unseen classes in test images without …

Reference twice: A simple and unified baseline for few-shot instance segmentation

Y Han, J Zhang, Y Wang, C Wang, Y Liu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Few-Shot Instance Segmentation (FSIS) requires detecting and segmenting novel classes
with limited support examples. Existing methods based on Region Proposal Networks …

Entropic open-set active learning

B Safaei, VS Vibashan, CM de Melo… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Active Learning (AL) aims to enhance the performance of deep models by selecting the most
informative samples for annotation from a pool of unlabeled data. Despite impressive …

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

NA Shah, V VS, VM Patel - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Abstract Referring Image Segmentation (RIS) aims to segment objects from an image based
on a language description. Recent advancements have introduced transformer-based …