Openmask3d: Open-vocabulary 3d instance segmentation

J Wu, X Li, S Xu, H Yuan, H Ding… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …

被引用次数：55 相关文章所有 10 个版本

[PDF] thecvf.com

Multi3drefer: Grounding text description to multiple 3d objects

Y Zhang, ZM Gong, AX Chang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We introduce the task of localizing a flexible number of objects in real-world 3D scenes
using natural language descriptions. Existing 3D visual grounding tasks focus on localizing …

被引用次数：21 相关文章所有 7 个版本

[PDF] thecvf.com

Clip-fo3d: Learning free open-world 3d scene representations from 2d dense clip

J Zhang, R Dong, K Ma - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Training a 3D scene understanding model requires complicated human annotations, which
are laborious to collect and result in a model only encoding close-set object semantics. In …

被引用次数：38 相关文章所有 6 个版本

[PDF] thecvf.com

Embodiedscan: A holistic multi-modal 3d perception suite towards embodied ai

T Wang, X Mao, C Zhu, R Xu, R Lyu… - Proceedings of the …, 2024 - openaccess.thecvf.com

In the realm of computer vision and robotics embodied agents are expected to explore their
environment and carry out human instructions. This necessitates the ability to fully …

被引用次数：13 相关文章所有 4 个版本

[PDF] thecvf.com

Open3dis: Open-vocabulary 3d instance segmentation with 2d mask guidance

P Nguyen, TD Ngo, E Kalogerakis… - Proceedings of the …, 2024 - openaccess.thecvf.com

We introduce Open3DIS a novel solution designed to tackle the problem of Open-
Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments …

被引用次数：12 相关文章所有 3 个版本

[PDF] thecvf.com

Regionplc: Regional point-language contrastive learning for open-world 3d scene understanding

J Yang, R Ding, W Deng, Z Wang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

We propose a lightweight and scalable Regional Point-Language Contrastive learning
framework namely RegionPLC for open-world 3D scene understanding aiming to identify …

被引用次数：28 相关文章所有 3 个版本

[PDF] arxiv.org

Conceptgraphs: Open-vocabulary 3d scene graphs for perception and planning

Q Gu, A Kuwajerwala, S Morin… - arXiv preprint arXiv …, 2023 - arxiv.org

For robots to perform a wide variety of tasks, they require a 3D representation of the world
that is semantically rich, yet compact and efficient for task-driven perception and planning …

被引用次数：50 相关文章所有 6 个版本

[PDF] thecvf.com

Language embedded 3d gaussians for open-vocabulary scene understanding

JC Shi, M Wang, HB Duan… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Open-vocabulary querying in 3D space is challenging but essential for scene understanding
tasks such as object localization and segmentation. Language-embedded scene …

被引用次数：13 相关文章所有 3 个版本

[PDF] thecvf.com

Scenefun3d: Fine-grained functionality and affordance understanding in 3d scenes

A Delitzas, A Takmaz, F Tombari… - Proceedings of the …, 2024 - openaccess.thecvf.com

Existing 3D scene understanding methods are heavily focused on 3D semantic and instance
segmentation. However identifying objects and their parts only constitutes an intermediate …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Gaussian grouping: Segment and edit anything in 3d scenes

M Ye, M Danelljan, F Yu, L Ke - arXiv preprint arXiv:2312.00732, 2023 - arxiv.org

The recent Gaussian Splatting achieves high-quality and real-time novel-view synthesis of
the 3D scenes. However, it is solely concentrated on the appearance and geometry …

被引用次数：32 相关文章所有 2 个版本

高级搜索

QQ 群