Hiql: Offline goal-conditioned rl with latent states as actions

S Park, D Ghosh, B Eysenbach… - Advances in Neural …, 2024 - proceedings.neurips.cc
Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …

Latent plans for task-agnostic offline reinforcement learning

E Rosete-Beas, O Mees, G Kalweit… - … on Robot Learning, 2023 - proceedings.mlr.press
Everyday tasks of long-horizon and comprising a sequence of multiple implicit subtasks still
impose a major challenge in offline robot control. While a number of prior methods aimed to …

Bootstrap your own skills: Learning to solve new tasks with large language model guidance

J Zhang, J Zhang, K Pertsch, Z Liu, X Ren… - arXiv preprint arXiv …, 2023 - arxiv.org
We propose BOSS, an approach that automatically learns to solve new long-horizon,
complex, and meaningful tasks by growing a learned skill library with minimal supervision …

Kite: Keypoint-conditioned policies for semantic manipulation

P Sundaresan, S Belkhale, D Sadigh… - arXiv preprint arXiv …, 2023 - arxiv.org
While natural language offers a convenient shared interface for humans and robots,
enabling robots to interpret and follow language commands remains a longstanding …

Learning to discover skills through guidance

H Kim, BK Lee, H Lee, D Hwang… - Advances in …, 2024 - proceedings.neurips.cc
In the field of unsupervised skill discovery (USD), a major challenge is limited exploration,
primarily due to substantial penalties when skills deviate from their initial trajectories. To …

Simple hierarchical planning with diffusion

C Chen, F Deng, K Kawaguchi, C Gulcehre… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion-based generative methods have proven effective in modeling trajectories with
offline datasets. However, they often face computational challenges and can falter in …

DiffVL: scaling up soft body manipulation using vision-language driven differentiable physics

Z Huang, F Chen, Y Pu, C Lin… - Advances in Neural …, 2023 - proceedings.neurips.cc
Combining gradient-based trajectory optimization with differentiable physics simulation is an
efficient technique for solving soft-body manipulation problems. Using a well-crafted …

Model-based runtime monitoring with interactive imitation learning

H Liu, S Dass, R Martín-Martín… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Robot learning methods have recently made great strides, but generalization and
robustness challenges still hinder their widespread deployment. Failing to detect and …

Pre-training goal-based models for sample-efficient reinforcement learning

H Yuan, Z Mu, F Xie, Z Lu - The Twelfth International Conference on …, 2024 - openreview.net
Pre-training on task-agnostic large datasets is a promising approach for enhancing the
sample efficiency of reinforcement learning (RL) in solving complex tasks. We present …

Dexdeform: Dexterous deformable object manipulation with human demonstrations and differentiable physics

S Li, Z Huang, T Chen, T Du, H Su… - arXiv preprint arXiv …, 2023 - arxiv.org
In this work, we aim to learn dexterous manipulation of deformable objects using multi-
fingered hands. Reinforcement learning approaches for dexterous rigid object manipulation …