相关文章- 学术资源搜索

A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

被引用次数：212 相关文章所有 7 个版本

[PDF] arxiv.org

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

被引用次数：1738 相关文章所有 3 个版本

[PDF] neurips.cc

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc

Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

被引用次数：113 相关文章所有 7 个版本

[PDF] qcloudimg.com

[PDF][PDF] Rl unplugged: Benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - arXiv preprint arXiv …, 2020 - ask.qcloudimg.com

Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

被引用次数：54 相关文章

[PDF] arxiv.org

Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning

D Yarats, D Brandfonbrener, H Liu, M Laskin… - arXiv preprint arXiv …, 2022 - arxiv.org

Recent progress in deep learning has relied on access to large and diverse datasets. Such
data-driven progress has been less evident in offline reinforcement learning (RL), because …

被引用次数：82 相关文章所有 3 个版本

[PDF] arxiv.org

D4rl: Datasets for deep data-driven reinforcement learning

J Fu, A Kumar, O Nachum, G Tucker… - arXiv preprint arXiv …, 2020 - arxiv.org

The offline reinforcement learning (RL) setting (also known as full batch RL), where a policy
is learned from a static dataset, is compelling as progress enables RL methods to take …

被引用次数：980 相关文章所有 3 个版本

[PDF] arxiv.org

Awac: Accelerating online reinforcement learning with offline datasets

A Nair, A Gupta, M Dalal, S Levine - arXiv preprint arXiv:2006.09359, 2020 - arxiv.org

Reinforcement learning (RL) provides an appealing formalism for learning control policies
from experience. However, the classic active formulation of RL necessitates a lengthy active …

被引用次数：481 相关文章所有 8 个版本

[PDF] arxiv.org

Behavior regularized offline reinforcement learning

Y Wu, G Tucker, O Nachum - arXiv preprint arXiv:1911.11361, 2019 - arxiv.org

In reinforcement learning (RL) research, it is common to assume access to direct online
interactions with the environment. However in many real-world applications, access to the …

被引用次数：688 相关文章所有 5 个版本

[PDF] neurips.cc

NeoRL: A near real-world benchmark for offline reinforcement learning

RJ Qin, X Zhang, S Gao, XH Chen… - Advances in …, 2022 - proceedings.neurips.cc

Offline reinforcement learning (RL) aims at learning effective policies from historical data
without extra environment interactions. During our experience of applying offline RL, we …

被引用次数：68 相关文章所有 5 个版本

[PDF] mlr.press

Efficient online reinforcement learning with offline data

PJ Ball, L Smith, I Kostrikov… - … Conference on Machine …, 2023 - proceedings.mlr.press

Sample efficiency and exploration remain major challenges in online reinforcement learning
(RL). A powerful approach that can be applied to address these issues is the inclusion of …

被引用次数：66 相关文章所有 6 个版本

高级搜索

QQ 群

A survey on offline reinforcement learning: Taxonomy, review, and open problems

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

Rl unplugged: A suite of benchmarks for offline reinforcement learning

[PDF][PDF] Rl unplugged: Benchmarks for offline reinforcement learning

Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning

D4rl: Datasets for deep data-driven reinforcement learning

Awac: Accelerating online reinforcement learning with offline datasets

Behavior regularized offline reinforcement learning

NeoRL: A near real-world benchmark for offline reinforcement learning

Efficient online reinforcement learning with offline data

相关搜索

引用