A workflow for offline model-free robotic reinforcement learning

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

被引用次数：229 相关文章所有 9 个版本

[PDF] royalsocietypublishing.org Full View

Learning robotic navigation from experience: principles, methods and recent results

S Levine, D Shah - … Transactions of the Royal Society B, 2023 - royalsocietypublishing.org

Navigation is one of the most heavily studied problems in robotics and is conventionally
approached as a geometric mapping and planning problem. However, real-world navigation …

被引用次数：16 相关文章所有 8 个版本

[PDF] arxiv.org

Is conditional generative modeling all you need for decision-making?

A Ajay, Y Du, A Gupta, J Tenenbaum… - arXiv preprint arXiv …, 2022 - arxiv.org

Recent improvements in conditional generative modeling have made it possible to generate
high-quality images from language descriptions alone. We investigate whether these …

被引用次数：234 相关文章所有 4 个版本

[PDF] thecvf.com

Affordances from human videos as a versatile representation for robotics

S Bahl, R Mendonca, L Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Building a robot that can understand and learn to interact by watching humans has inspired
several vision problems. However, despite some successful results on static datasets, it …

被引用次数：67 相关文章所有 9 个版本

[PDF] mlr.press

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions

Y Chebotar, Q Vuong, K Hausman… - … on Robot Learning, 2023 - proceedings.mlr.press

In this work, we present a scalable reinforcement learning method for training multi-task
policies from large offline datasets that can leverage both human demonstrations and …

被引用次数：44 相关文章所有 6 个版本

[PDF] arxiv.org

Vip: Towards universal visual reward and representation via value-implicit pre-training

YJ Ma, S Sodhani, D Jayaraman, O Bastani… - arXiv preprint arXiv …, 2022 - arxiv.org

Reward and representation learning are two long-standing challenges for learning an
expanding set of robot manipulation skills from sensory observations. Given the inherent …

被引用次数：164 相关文章所有 5 个版本

[PDF] neurips.cc

Rambo-rl: Robust adversarial model-based offline reinforcement learning

M Rigter, B Lacerda, N Hawes - Advances in neural …, 2022 - proceedings.neurips.cc

Offline reinforcement learning (RL) aims to find performant policies from logged data without
further environment interaction. Model-based algorithms, which learn a model of the …

被引用次数：97 相关文章所有 7 个版本

[PDF] neurips.cc

CORL: Research-oriented deep offline reinforcement learning library

D Tarasov, A Nikulin, D Akimov… - Advances in …, 2024 - proceedings.neurips.cc

CORL is an open-source library that provides thoroughly benchmarked single-file
implementations of both deep offline and offline-to-online reinforcement learning algorithms …

被引用次数：52 相关文章所有 6 个版本

[PDF] mlr.press

Q-learning decision transformer: Leveraging dynamic programming for conditional sequence modelling in offline rl

T Yamagata, A Khalil… - … on Machine Learning, 2023 - proceedings.mlr.press

Recent works have shown that tackling offline reinforcement learning (RL) with a conditional
policy produces promising results. The Decision Transformer (DT) combines the conditional …

被引用次数：50 相关文章所有 9 个版本

[PDF] mlr.press

Offline rl policies should be trained to be adaptive

D Ghosh, A Ajay, P Agrawal… - … Conference on Machine …, 2022 - proceedings.mlr.press

Offline RL algorithms must account for the fact that the dataset they are provided may leave
many facets of the environment unknown. The most common way to approach this challenge …

被引用次数：38 相关文章所有 5 个版本

高级搜索

QQ 群