igibson 2.0: Object-centric simulation for robot learning of everyday household tasks

W Liang, GA Tadesse, D Ho, L Fei-Fei… - Nature Machine …, 2022 - nature.com

As artificial intelligence (AI) transitions from research to deployment, creating the appropriate
datasets and data pipelines to develop and evaluate AI models is increasingly the biggest …

被引用次数：254 相关文章所有 3 个版本

[PDF] arxiv.org

A survey on active simultaneous localization and mapping: State of the art and new frontiers

JA Placed, J Strader, H Carrillo… - IEEE Transactions …, 2023 - ieeexplore.ieee.org

Active simultaneous localization and mapping (SLAM) is the problem of planning and
controlling the motion of a robot to build the most accurate and complete model of the …

被引用次数：147 相关文章所有 14 个版本

[PDF] thecvf.com

Objaverse: A universe of annotated 3d objects

M Deitke, D Schwenk, J Salvador… - Proceedings of the …, 2023 - openaccess.thecvf.com

Massive data corpora like WebText, Wikipedia, Conceptual Captions, WebImageText, and
LAION have propelled recent dramatic progress in AI. Large neural models trained on such …

被引用次数：466 相关文章所有 5 个版本

[PDF] arxiv.org

Tidybot: Personalized robot assistance with large language models

J Wu, R Antonova, A Kan, M Lepert, A Zeng, S Song… - Autonomous …, 2023 - Springer

For a robot to personalize physical assistance effectively, it must learn user preferences that
can be generally reapplied to future scenarios. In this work, we investigate personalization of …

被引用次数：222 相关文章所有 13 个版本

[PDF] thecvf.com

Unisim: A neural closed-loop sensor simulator

Z Yang, Y Chen, J Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Rigorously testing autonomy systems is essential for making safe self-driving vehicles (SDV)
a reality. It requires one to generate safety critical scenarios beyond what can be collected …

被引用次数：111 相关文章所有 8 个版本

[PDF] thecvf.com

Diffusion-based generation, optimization, and planning in 3d scenes

S Huang, Z Wang, P Li, B Jia, T Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

We introduce SceneDiffuser, a conditional generative model for 3D scene understanding.
SceneDiffuser provides a unified model for solving scene-conditioned generation …

被引用次数：137 相关文章所有 9 个版本

[PDF] neurips.cc

🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

M Deitke, E VanderBilt, A Herrasti… - Advances in …, 2022 - proceedings.neurips.cc

Massive datasets and high-capacity models have driven many recent advancements in
computer vision and natural language understanding. This work presents a platform to …

被引用次数：150 相关文章所有 5 个版本

[PDF] caltech.edu

[PDF][PDF] Vima: General robot manipulation with multimodal prompts

Y Jiang, A Gupta, Z Zhang, G Wang… - arXiv preprint …, 2022 - authors.library.caltech.edu

Prompt-based learning has emerged as a successful paradigm in natural language
processing, where a single general-purpose language model can be instructed to perform …

被引用次数：153 相关文章所有 6 个版本

[PDF] mlr.press

Behavior-1k: A benchmark for embodied ai with 1,000 everyday activities and realistic simulation

C Li, R Zhang, J Wong, C Gokmen… - … on Robot Learning, 2023 - proceedings.mlr.press

We present BEHAVIOR-1K, a comprehensive simulation benchmark for human-centered
robotics. BEHAVIOR-1K includes two components, guided and motivated by the results of an …

被引用次数：117 相关文章所有 3 个版本

[PDF] thecvf.com

Simple but effective: Clip embeddings for embodied ai

A Khandelwal, L Weihs, R Mottaghi… - Proceedings of the …, 2022 - openaccess.thecvf.com

Contrastive language image pretraining (CLIP) encoders have been shown to be beneficial
for a range of visual tasks from classification and detection to captioning and image …

被引用次数：196 相关文章所有 5 个版本

高级搜索

QQ 群