Mask-free ovis: Open-vocabulary instance segmentation without manual mask annotations

J Wu, X Li, S Xu, H Yuan, H Ding… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …

被引用次数：64 相关文章所有 10 个版本

[PDF] arxiv.org

A survey on open-vocabulary detection and segmentation: Past, present, and future

C Zhu, L Chen - IEEE Transactions on Pattern Analysis and …, 2024 - ieeexplore.ieee.org

As the most fundamental scene understanding tasks, object detection and segmentation
have made tremendous progress in deep learning era. Due to the expensive manual …

被引用次数：10 相关文章所有 7 个版本

[PDF] arxiv.org

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - arXiv preprint arXiv …, 2023 - arxiv.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

被引用次数：59 相关文章所有 3 个版本

[PDF] thecvf.com

Open3dis: Open-vocabulary 3d instance segmentation with 2d mask guidance

P Nguyen, TD Ngo, E Kalogerakis… - Proceedings of the …, 2024 - openaccess.thecvf.com

We introduce Open3DIS a novel solution designed to tackle the problem of Open-
Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments …

被引用次数：14 相关文章所有 3 个版本

[PDF] thecvf.com

Mamo: Leveraging memory and attention for monocular video depth estimation

R Yasarla, H Cai, J Jeong, Y Shi… - Proceedings of the …, 2023 - openaccess.thecvf.com

We propose MAMo, a novel memory and attention framework for monocular video depth
estimation. MAMo can augment and improve any single-image depth estimation networks …

被引用次数：5 相关文章所有 6 个版本

[PDF] thecvf.com

Maskclustering: View consensus based mask graph clustering for open-vocabulary 3d instance segmentation

M Yan, J Zhang, Y Zhu, H Wang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Open-vocabulary 3D instance segmentation is cutting-edge for its ability to segment 3D
instances without predefined categories. However progress in 3D lags behind its 2D …

被引用次数：4 相关文章所有 3 个版本

[PDF] thecvf.com

Lp-ovod: Open-vocabulary object detection by linear probing

C Pham, T Vu, K Nguyen - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com

This paper addresses the challenging problem of open-vocabulary object detection (OVOD)
where an object detector must identify both seen and unseen classes in test images without …

被引用次数：6 相关文章所有 5 个版本

[PDF] arxiv.org

Reference twice: A simple and unified baseline for few-shot instance segmentation

Y Han, J Zhang, Y Wang, C Wang, Y Liu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Few-Shot Instance Segmentation (FSIS) requires detecting and segmenting novel classes
with limited support examples. Existing methods based on Region Proposal Networks …

被引用次数：9 相关文章所有 2 个版本

[PDF] aaai.org

Entropic open-set active learning

B Safaei, VS Vibashan, CM de Melo… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Active Learning (AL) aims to enhance the performance of deep models by selecting the most
informative samples for annotation from a pool of unlabeled data. Despite impressive …

被引用次数：2 相关文章所有 4 个版本

[PDF] thecvf.com

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

NA Shah, V VS, VM Patel - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Abstract Referring Image Segmentation (RIS) aims to segment objects from an image based
on a language description. Recent advancements have introduced transformer-based …

被引用次数：1 相关文章

高级搜索

QQ 群