相关文章- 学术资源搜索

Copa: General robotic manipulation through spatial constraints of parts with foundation models

H Huang, F Lin, Y Hu, S Wang, Y Gao - arXiv preprint arXiv:2403.08248, 2024 - arxiv.org

Foundation models pre-trained on web-scale data are shown to encapsulate extensive
world knowledge beneficial for robotic manipulation in the form of task planning. However …

被引用次数：3 相关文章所有 4 个版本

[PDF] arxiv.org

Pave the way to grasp anything: Transferring foundation models for universal pick-place robots

J Yang, W Tan, C Jin, B Liu, J Fu, R Song… - arXiv preprint arXiv …, 2023 - arxiv.org

Improving the generalization capabilities of general-purpose robotic agents has long been a
significant challenge actively pursued by research communities. Existing approaches often …

被引用次数：21 相关文章

Physically grounded vision-language models for robotic manipulation

J Gao, B Sarkar, F Xia, T Xiao, J Wu, B Ichter… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent advances in vision-language models (VLMs) have led to improved performance on
tasks such as visual question answering and image captioning. Consequently, these models …

被引用次数：42 相关文章所有 2 个版本

[PDF] arxiv.org

Alphablock: Embodied finetuning for vision-language reasoning in robot manipulation

C Jin, W Tan, J Yang, B Liu, R Song, L Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

We propose a novel framework for learning high-level cognitive capabilities in robot
manipulation tasks, such as making a smiley face using building blocks. These tasks often …

被引用次数：16 相关文章所有 2 个版本

[PDF] arxiv.org

Deep compositional robotic planners that follow natural language commands

YL Kuo, B Katz, A Barbu - 2020 IEEE international conference …, 2020 - ieeexplore.ieee.org

We demonstrate how a sampling-based robotic planner can be augmented to learn to
understand a sequence of natural language commands in a continuous configuration space …

被引用次数：25 相关文章所有 11 个版本

Geometry-based grasping pipeline for bi-modal pick and place

R Haschke, G Walck, H Ritter - 2021 IEEE/RSJ International …, 2021 - ieeexplore.ieee.org

We propose an autonomous grasping pipeline that relies on geometric information extracted
from segmented point cloud data. This is in contrast to many recent approaches leveraging …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Robot task planning and situation handling in open worlds

Y Ding, X Zhang, S Amiri, N Cao, H Yang… - arXiv preprint arXiv …, 2022 - arxiv.org

Automated task planning algorithms have been developed to help robots complete complex
tasks that require multiple actions. Most of those algorithms have been developed for" …

被引用次数：14 相关文章所有 2 个版本

[PDF] arxiv.org

Planning with spatial-temporal abstraction from point clouds for deformable object manipulation

X Lin, C Qi, Y Zhang, Z Huang, K Fragkiadaki… - arXiv preprint arXiv …, 2022 - arxiv.org

Effective planning of long-horizon deformable object manipulation requires suitable
abstractions at both the spatial and temporal levels. Previous methods typically either focus …

被引用次数：28 相关文章所有 8 个版本

[PDF] mlr.press

A long horizon planning framework for manipulating rigid pointcloud objects

A Simeonov, Y Du, B Kim, F Hogan… - … on Robot Learning, 2021 - proceedings.mlr.press

We present a framework for solving long-horizon planning problems involving manipulation
of rigid objects that operates directly from a point-cloud observation. Our method plans in the …

被引用次数：35 相关文章所有 5 个版本

[PDF] arxiv.org

Hierarchical planning for long-horizon manipulation with geometric and symbolic scene graphs

Y Zhu, J Tremblay, S Birchfield… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org

We present a visually grounded hierarchical planning algorithm for long-horizon
manipulation tasks. Our algorithm offers a joint framework of neuro-symbolic task planning …

被引用次数：103 相关文章所有 4 个版本

高级搜索

QQ 群

Copa: General robotic manipulation through spatial constraints of parts with foundation models

Pave the way to grasp anything: Transferring foundation models for universal pick-place robots

Physically grounded vision-language models for robotic manipulation

Alphablock: Embodied finetuning for vision-language reasoning in robot manipulation

Deep compositional robotic planners that follow natural language commands

Geometry-based grasping pipeline for bi-modal pick and place

Robot task planning and situation handling in open worlds

Planning with spatial-temporal abstraction from point clouds for deformable object manipulation

A long horizon planning framework for manipulating rigid pointcloud objects

Hierarchical planning for long-horizon manipulation with geometric and symbolic scene graphs

相关搜索

引用