Imitation learning with sinkhorn distances

S Haldar, V Mathur, D Yarats… - Conference on Robot …, 2023 - proceedings.mlr.press

Imitation learning holds tremendous promise in learning policies efficiently for complex
decision making problems. Current state-of-the-art algorithms often use inverse …

被引用次数：40 相关文章所有 7 个版本

[PDF] mlr.press

For pre-trained vision models in motor control, not all policy learning methods are created equal

Y Hu, R Wang, LE Li, Y Gao - International Conference on …, 2023 - proceedings.mlr.press

In recent years, increasing attention has been directed to leveraging pre-trained vision
models for motor control. While existing works mainly emphasize the importance of this pre …

被引用次数：13 相关文章所有 6 个版本

[PDF] neurips.cc

Can pre-trained text-to-image models generate visual goals for reinforcement learning?

J Gao, K Hu, G Xu, H Xu - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Pre-trained text-to-image generative models can produce diverse, semantically rich, and
realistic images from natural language descriptions. Compared with language, images …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Cross-domain imitation learning via optimal transport

A Fickinger, S Cohen, S Russell, B Amos - arXiv preprint arXiv:2110.03684, 2021 - arxiv.org

Cross-domain imitation learning studies how to leverage expert demonstrations of one
agent to train an imitation agent with a different embodiment or morphology. Comparing …

被引用次数：41 相关文章所有 4 个版本

[PDF] mlr.press

Policy contrastive imitation learning

J Huang, ZH Yin, Y Hu, Y Gao - International Conference on …, 2023 - proceedings.mlr.press

Adversarial imitation learning (AIL) is a popular method that has recently achieved much
success. However, the performance of AIL is still unsatisfactory on the more challenging …

被引用次数：4 相关文章所有 8 个版本

[PDF] arxiv.org

What Matters to You? Towards Visual Representation Alignment for Robot Learning

R Tian, C Xu, M Tomizuka, J Malik, A Bajcsy - arXiv preprint arXiv …, 2023 - arxiv.org

When operating in service of people, robots need to optimize rewards aligned with end-user
preferences. Since robots will rely on raw perceptual inputs like RGB images, their rewards …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Imitation learning from observation with automatic discount scheduling

Y Liu, W Dong, Y Hu, C Wen, ZH Yin, C Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Humans often acquire new skills through observation and imitation. For robotic agents,
learning from the plethora of unlabeled video demonstration data available on the Internet …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments

M Zare, PM Kebria, A Khosravi - arXiv preprint arXiv:2310.08841, 2023 - arxiv.org

Most Reinforcement Learning (RL) methods are traditionally studied in an active learning
setting, where agents directly interact with their environments, observe action outcomes, and …

被引用次数：1 相关文章所有 3 个版本

[PDF] openreview.net

Imitation learning from pixel observations for continuous control

S Cohen, B Amos, MP Deisenroth, M Henaff, E Vinitsky… - 2021 - openreview.net

We study imitation learning using only visual observations for controlling dynamical systems
with continuous states and actions. This setting is attractive due to the large amount of video …

被引用次数：10 相关文章所有 2 个版本

[PDF] aaai.org

Robust Visual Imitation Learning with Inverse Dynamics Representations

S Li, X Wang, R Zuo, K Sun, L Cui, J Ding… - Proceedings of the …, 2024 - ojs.aaai.org

Imitation learning (IL) has achieved considerable success in solving complex sequential
decision-making problems. However, current IL methods mainly assume that the …

高级搜索

QQ 群