Maniskill: Generalizable manipulation skill benchmark with large-scale demonstrations

S Huang, Z Wang, P Li, B Jia, T Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

We introduce SceneDiffuser, a conditional generative model for 3D scene understanding.
SceneDiffuser provides a unified model for solving scene-conditioned generation …

被引用次数：178 相关文章所有 9 个版本

[PDF] mlr.press

Bridgedata v2: A dataset for robot learning at scale

HR Walke, K Black, TZ Zhao, Q Vuong… - … on Robot Learning, 2023 - proceedings.mlr.press

We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors
designed to facilitate research in scalable robot learning. BridgeData V2 contains 53,896 …

被引用次数：101 相关文章所有 4 个版本

[PDF] neurips.cc

🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

M Deitke, E VanderBilt, A Herrasti… - Advances in …, 2022 - proceedings.neurips.cc

Massive datasets and high-capacity models have driven many recent advancements in
computer vision and natural language understanding. This work presents a platform to …

被引用次数：199 相关文章所有 5 个版本

[PDF] mlr.press

Behavior-1k: A benchmark for embodied ai with 1,000 everyday activities and realistic simulation

C Li, R Zhang, J Wong, C Gokmen… - … on Robot Learning, 2023 - proceedings.mlr.press

We present BEHAVIOR-1K, a comprehensive simulation benchmark for human-centered
robotics. BEHAVIOR-1K includes two components, guided and motivated by the results of an …

被引用次数：157 相关文章所有 3 个版本

[PDF] arxiv.org

Sceneverse: Scaling 3d vision-language learning for grounded scene understanding

B Jia, Y Chen, H Yu, Y Wang, X Niu, T Liu, Q Li… - … on Computer Vision, 2025 - Springer

Abstract 3D vision-language (3D-VL) grounding, which aims to align language with 3D
physical environments, stands as a cornerstone in developing embodied agents. In …

被引用次数：39 相关文章所有 2 个版本

[PDF] thecvf.com

Unidexgrasp: Universal robotic dexterous grasping via learning diverse proposal generation and goal-conditioned policy

Y Xu, W Wan, J Zhang, H Liu, Z Shan… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this work, we tackle the problem of learning universal robotic dexterous grasping from a
point cloud observation under a table-top setting. The goal is to grasp and lift up objects in …

被引用次数：91 相关文章所有 9 个版本

[PDF] thecvf.com

Hoi4d: A 4d egocentric dataset for category-level human-object interaction

Y Liu, Y Liu, C Jiang, K Lyu, W Wan… - Proceedings of the …, 2022 - openaccess.thecvf.com

We present HOI4D, a large-scale 4D egocentric dataset with rich annotations, to catalyze the
research of category-level human-object interaction. HOI4D consists of 2.4 M RGB-D …

被引用次数：139 相关文章所有 5 个版本

[PDF] thecvf.com

Unidexgrasp++: Improving dexterous grasping policy learning via geometry-aware curriculum and iterative generalist-specialist learning

W Wan, H Geng, Y Liu, Z Shan… - Proceedings of the …, 2023 - openaccess.thecvf.com

We propose a novel, object-agnostic method for learning a universal policy for dexterous
object grasping from realistic point cloud observations and proprioceptive information under …

被引用次数：70 相关文章所有 5 个版本

[PDF] thecvf.com

Neural volumetric memory for visual locomotion control

R Yang, G Yang, X Wang - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

Legged robots have the potential to expand the reach of autonomy beyond paved roads. In
this work, we consider the difficult problem of locomotion on challenging terrains using a …

被引用次数：47 相关文章所有 5 个版本

[PDF] thecvf.com

Gapartnet: Cross-category domain-generalizable object perception and manipulation via generalizable and actionable parts

H Geng, H Xu, C Zhao, C Xu, L Yi… - Proceedings of the …, 2023 - openaccess.thecvf.com

For years, researchers have been devoted to generalizable object perception and
manipulation, where cross-category generalizability is highly desired yet underexplored. In …

被引用次数：78 相关文章所有 6 个版本

高级搜索

QQ 群