Watch and match: Supercharging imitation with regularized optimal transport

S Haldar, V Mathur, D Yarats… - Conference on Robot …, 2023 - proceedings.mlr.press
Imitation learning holds tremendous promise in learning policies efficiently for complex
decision making problems. Current state-of-the-art algorithms often use inverse …

For pre-trained vision models in motor control, not all policy learning methods are created equal

Y Hu, R Wang, LE Li, Y Gao - International Conference on …, 2023 - proceedings.mlr.press
In recent years, increasing attention has been directed to leveraging pre-trained vision
models for motor control. While existing works mainly emphasize the importance of this pre …

Can pre-trained text-to-image models generate visual goals for reinforcement learning?

J Gao, K Hu, G Xu, H Xu - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Pre-trained text-to-image generative models can produce diverse, semantically rich, and
realistic images from natural language descriptions. Compared with language, images …

Cross-domain imitation learning via optimal transport

A Fickinger, S Cohen, S Russell, B Amos - arXiv preprint arXiv:2110.03684, 2021 - arxiv.org
Cross-domain imitation learning studies how to leverage expert demonstrations of one
agent to train an imitation agent with a different embodiment or morphology. Comparing …

Policy contrastive imitation learning

J Huang, ZH Yin, Y Hu, Y Gao - International Conference on …, 2023 - proceedings.mlr.press
Adversarial imitation learning (AIL) is a popular method that has recently achieved much
success. However, the performance of AIL is still unsatisfactory on the more challenging …

What Matters to You? Towards Visual Representation Alignment for Robot Learning

R Tian, C Xu, M Tomizuka, J Malik, A Bajcsy - arXiv preprint arXiv …, 2023 - arxiv.org
When operating in service of people, robots need to optimize rewards aligned with end-user
preferences. Since robots will rely on raw perceptual inputs like RGB images, their rewards …

Imitation learning from observation with automatic discount scheduling

Y Liu, W Dong, Y Hu, C Wen, ZH Yin, C Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Humans often acquire new skills through observation and imitation. For robotic agents,
learning from the plethora of unlabeled video demonstration data available on the Internet …

Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments

M Zare, PM Kebria, A Khosravi - arXiv preprint arXiv:2310.08841, 2023 - arxiv.org
Most Reinforcement Learning (RL) methods are traditionally studied in an active learning
setting, where agents directly interact with their environments, observe action outcomes, and …

Imitation learning from pixel observations for continuous control

S Cohen, B Amos, MP Deisenroth, M Henaff, E Vinitsky… - 2021 - openreview.net
We study imitation learning using only visual observations for controlling dynamical systems
with continuous states and actions. This setting is attractive due to the large amount of video …

Robust Visual Imitation Learning with Inverse Dynamics Representations

S Li, X Wang, R Zuo, K Sun, L Cui, J Ding… - Proceedings of the …, 2024 - ojs.aaai.org
Imitation learning (IL) has achieved considerable success in solving complex sequential
decision-making problems. However, current IL methods mainly assume that the …