Skill-based model-based reinforcement learning

S Park, D Ghosh, B Eysenbach… - Advances in Neural …, 2024 - proceedings.neurips.cc

Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …

被引用次数：38 相关文章所有 6 个版本

[PDF] mlr.press

Latent plans for task-agnostic offline reinforcement learning

E Rosete-Beas, O Mees, G Kalweit… - … on Robot Learning, 2023 - proceedings.mlr.press

Everyday tasks of long-horizon and comprising a sequence of multiple implicit subtasks still
impose a major challenge in offline robot control. While a number of prior methods aimed to …

被引用次数：73 相关文章所有 6 个版本

[PDF] arxiv.org

Bootstrap your own skills: Learning to solve new tasks with large language model guidance

J Zhang, J Zhang, K Pertsch, Z Liu, X Ren… - arXiv preprint arXiv …, 2023 - arxiv.org

We propose BOSS, an approach that automatically learns to solve new long-horizon,
complex, and meaningful tasks by growing a learned skill library with minimal supervision …

被引用次数：60 相关文章所有 7 个版本

[PDF] arxiv.org

Kite: Keypoint-conditioned policies for semantic manipulation

P Sundaresan, S Belkhale, D Sadigh… - arXiv preprint arXiv …, 2023 - arxiv.org

While natural language offers a convenient shared interface for humans and robots,
enabling robots to interpret and follow language commands remains a longstanding …

被引用次数：22 相关文章所有 7 个版本

[PDF] neurips.cc

Learning to discover skills through guidance

H Kim, BK Lee, H Lee, D Hwang… - Advances in …, 2024 - proceedings.neurips.cc

In the field of unsupervised skill discovery (USD), a major challenge is limited exploration,
primarily due to substantial penalties when skills deviate from their initial trajectories. To …

被引用次数：7 相关文章所有 5 个版本

[PDF] arxiv.org

Simple hierarchical planning with diffusion

C Chen, F Deng, K Kawaguchi, C Gulcehre… - arXiv preprint arXiv …, 2024 - arxiv.org

Diffusion-based generative methods have proven effective in modeling trajectories with
offline datasets. However, they often face computational challenges and can falter in …

被引用次数：17 相关文章所有 5 个版本

[PDF] neurips.cc

DiffVL: scaling up soft body manipulation using vision-language driven differentiable physics

Z Huang, F Chen, Y Pu, C Lin… - Advances in Neural …, 2023 - proceedings.neurips.cc

Combining gradient-based trajectory optimization with differentiable physics simulation is an
efficient technique for solving soft-body manipulation problems. Using a well-crafted …

被引用次数：3 相关文章所有 7 个版本

[PDF] arxiv.org

Model-based runtime monitoring with interactive imitation learning

H Liu, S Dass, R Martín-Martín… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org

Robot learning methods have recently made great strides, but generalization and
robustness challenges still hinder their widespread deployment. Failing to detect and …

被引用次数：11 相关文章所有 2 个版本

[PDF] openreview.net

Pre-training goal-based models for sample-efficient reinforcement learning

H Yuan, Z Mu, F Xie, Z Lu - The Twelfth International Conference on …, 2024 - openreview.net

Pre-training on task-agnostic large datasets is a promising approach for enhancing the
sample efficiency of reinforcement learning (RL) in solving complex tasks. We present …

被引用次数：7 相关文章

[PDF] arxiv.org

Dexdeform: Dexterous deformable object manipulation with human demonstrations and differentiable physics

S Li, Z Huang, T Chen, T Du, H Su… - arXiv preprint arXiv …, 2023 - arxiv.org

In this work, we aim to learn dexterous manipulation of deformable objects using multi-
fingered hands. Reinforcement learning approaches for dexterous rigid object manipulation …

被引用次数：18 相关文章所有 3 个版本

高级搜索

QQ 群