相关文章- 学术资源搜索

Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning

D Yarats, D Brandfonbrener, H Liu, M Laskin… - arXiv preprint arXiv …, 2022 - arxiv.org

Recent progress in deep learning has relied on access to large and diverse datasets. Such
data-driven progress has been less evident in offline reinforcement learning (RL), because …

被引用次数：85 相关文章所有 3 个版本

[PDF] neurips.cc

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc

Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

被引用次数：171 相关文章所有 8 个版本

[PDF] arxiv.org

Hyperparameter selection for offline reinforcement learning

TL Paine, C Paduraru, A Michi, C Gulcehre… - arXiv preprint arXiv …, 2020 - arxiv.org

Offline reinforcement learning (RL purely from logged data) is an important avenue for
deploying RL techniques in real-world scenarios. However, existing hyperparameter …

被引用次数：153 相关文章所有 2 个版本

[PDF] ieee.org

A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

被引用次数：248 相关文章所有 9 个版本

[PDF] neurips.cc

Data-efficient pipeline for offline reinforcement learning with limited data

A Nie, Y Flet-Berliac, D Jordan… - Advances in …, 2022 - proceedings.neurips.cc

Offline reinforcement learning (RL) can be used to improve future performance by
leveraging historical data. There exist many different algorithms for offline RL, and it is well …

被引用次数：15 相关文章所有 8 个版本

[PDF] neurips.cc

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

被引用次数：698 相关文章所有 6 个版本

[PDF] arxiv.org

The challenges of exploration for offline reinforcement learning

N Lambert, M Wulfmeier, W Whitney, A Byravan… - arXiv preprint arXiv …, 2022 - arxiv.org

Offline Reinforcement Learning (ORL) enablesus to separately study the two interlinked
processes of reinforcement learning: collecting informative experience and inferring optimal …

被引用次数：38 相关文章所有 4 个版本

[PDF] neurips.cc

Conservative data sharing for multi-task offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - Advances in …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …

被引用次数：79 相关文章所有 6 个版本

[PDF] neurips.cc

Survival instinct in offline reinforcement learning

A Li, D Misra, A Kolobov… - Advances in neural …, 2024 - proceedings.neurips.cc

We present a novel observation about the behavior of offline reinforcement learning (RL)
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …

被引用次数：12 相关文章所有 5 个版本

[PDF] arxiv.org

D4rl: Datasets for deep data-driven reinforcement learning

J Fu, A Kumar, O Nachum, G Tucker… - arXiv preprint arXiv …, 2020 - arxiv.org

The offline reinforcement learning (RL) setting (also known as full batch RL), where a policy
is learned from a static dataset, is compelling as progress enables RL methods to take …

被引用次数：1048 相关文章所有 3 个版本

高级搜索

QQ 群

Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning

Rl unplugged: A suite of benchmarks for offline reinforcement learning

Hyperparameter selection for offline reinforcement learning

A survey on offline reinforcement learning: Taxonomy, review, and open problems

Data-efficient pipeline for offline reinforcement learning with limited data

A minimalist approach to offline reinforcement learning

The challenges of exploration for offline reinforcement learning

Conservative data sharing for multi-task offline reinforcement learning

Survival instinct in offline reinforcement learning

D4rl: Datasets for deep data-driven reinforcement learning

相关搜索

引用