Moto: Offline pre-training to online fine-tuning for model-based robot learning

V Kolev, R Rafailov, K Hatch, J Wu, C Finn - arXiv preprint arXiv …, 2024 - arxiv.org

We tackle the problem of policy learning from expert demonstrations without a reward
function. A central challenge in this space is that these policies fail upon deployment due to …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation

N Hirose, D Shah, K Stachowicz, A Sridhar… - arXiv preprint arXiv …, 2024 - arxiv.org

Autonomous self-improving robots that interact and improve with experience are key to the
real-world deployment of robotic systems. In this paper, we propose an online learning …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning

XH Liu, TS Liu, S Jiang, R Chen, Z Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Combining offline and online reinforcement learning (RL) techniques is indeed crucial for
achieving efficient and safe learning where data acquisition is expensive. Existing methods …

[PDF] arxiv.org

[PDF] openreview.net

Guided Decoupled Exploration for Offline Reinforcement Learning Fine-tuning

Y Fu, D Wu, B Boulet - openreview.net

Fine-tuning pre-trained offline Reinforcement Learning (RL) agents with online interactions
is a promising strategy to improve the sample efficiency. In this work, we study the problem …

高级搜索

QQ 群