相关文章- 学术资源搜索

Objaverse: A universe of annotated 3d objects

M Deitke, D Schwenk, J Salvador… - Proceedings of the …, 2023 - openaccess.thecvf.com

Massive data corpora like WebText, Wikipedia, Conceptual Captions, WebImageText, and
LAION have propelled recent dramatic progress in AI. Large neural models trained on such …

被引用次数：444 相关文章所有 5 个版本

[PDF] thecvf.com

Omniobject3d: Large-vocabulary 3d object dataset for realistic perception, reconstruction and generation

T Wu, J Zhang, X Fu, Y Wang, J Ren… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of
large-scale real-scanned 3D databases. To facilitate the development of 3D perception …

被引用次数：106 相关文章所有 5 个版本

[PDF] thecvf.com

Pla: Language-driven open-vocabulary 3d scene understanding

R Ding, J Yang, C Xue, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Open-vocabulary scene understanding aims to localize and recognize unseen categories
beyond the annotated label space. The recent breakthrough of 2D open-vocabulary …

被引用次数：94 相关文章所有 8 个版本

[PDF] arxiv.org

Lowis3d: Language-driven open-world instance-level 3d scene understanding

R Ding, J Yang, C Xue, W Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Open-world instance-level scene understanding aims to locate and recognize unseen object
categories that are not present in the annotated dataset. This task is challenging because …

被引用次数：9 相关文章所有 6 个版本

[PDF] thecvf.com

Ulip: Learning a unified representation of language, images, and point clouds for 3d understanding

L Xue, M Gao, C Xing, R Martín-Martín… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recognition capabilities of current state-of-the-art 3D models are limited by datasets with
a small number of annotated data and a pre-defined set of categories. In its 2D counterpart …

被引用次数：158 相关文章所有 6 个版本

[PDF] thecvf.com

Learning 3d object categories by looking around them

D Novotny, D Larlus, A Vedaldi - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

Traditional approaches for learning 3D object categories use either synthetic data or manual
supervision. In this paper, we propose a method which does not require manual annotations …

被引用次数：86 相关文章所有 11 个版本

[PDF] thecvf.com

Embodiedscan: A holistic multi-modal 3d perception suite towards embodied ai

T Wang, X Mao, C Zhu, R Xu, R Lyu… - Proceedings of the …, 2024 - openaccess.thecvf.com

In the realm of computer vision and robotics embodied agents are expected to explore their
environment and carry out human instructions. This necessitates the ability to fully …

被引用次数：15 相关文章所有 4 个版本

[PDF] thecvf.com

Clip-fo3d: Learning free open-world 3d scene representations from 2d dense clip

J Zhang, R Dong, K Ma - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Training a 3D scene understanding model requires complicated human annotations, which
are laborious to collect and result in a model only encoding close-set object semantics. In …

被引用次数：42 相关文章所有 6 个版本

[PDF] neurips.cc

Objaverse-xl: A universe of 10m+ 3d objects

M Deitke, R Liu, M Wallingford, H Ngo… - Advances in …, 2024 - proceedings.neurips.cc

Natural language processing and 2D vision models have attained remarkable proficiency on
many tasks primarily by escalating the scale of training data. However, 3D vision tasks have …

被引用次数：164 相关文章所有 6 个版本

[PDF] neurips.cc

Scalable 3d captioning with pretrained models

T Luo, C Rockwell, H Lee… - Advances in Neural …, 2024 - proceedings.neurips.cc

We introduce Cap3D, an automatic approach for generating descriptive text for 3D objects.
This approach utilizes pretrained models from image captioning, image-text alignment, and …

被引用次数：80 相关文章所有 6 个版本

高级搜索

QQ 群

Objaverse: A universe of annotated 3d objects

Omniobject3d: Large-vocabulary 3d object dataset for realistic perception, reconstruction and generation

Pla: Language-driven open-vocabulary 3d scene understanding

Lowis3d: Language-driven open-world instance-level 3d scene understanding

Ulip: Learning a unified representation of language, images, and point clouds for 3d understanding

Learning 3d object categories by looking around them

Embodiedscan: A holistic multi-modal 3d perception suite towards embodied ai

Clip-fo3d: Learning free open-world 3d scene representations from 2d dense clip

Objaverse-xl: A universe of 10m+ 3d objects

Scalable 3d captioning with pretrained models

引用