相关文章- 学术资源搜索

D4rl: Datasets for deep data-driven reinforcement learning

J Fu, A Kumar, O Nachum, G Tucker… - arXiv preprint arXiv …, 2020 - arxiv.org

The offline reinforcement learning (RL) setting (also known as full batch RL), where a policy
is learned from a static dataset, is compelling as progress enables RL methods to take …

被引用次数：1000 相关文章所有 3 个版本

[PDF] ieee.org

A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

被引用次数：225 相关文章所有 9 个版本

[PDF] arxiv.org

Awac: Accelerating online reinforcement learning with offline datasets

A Nair, A Gupta, M Dalal, S Levine - arXiv preprint arXiv:2006.09359, 2020 - arxiv.org

Reinforcement learning (RL) provides an appealing formalism for learning control policies
from experience. However, the classic active formulation of RL necessitates a lengthy active …

被引用次数：499 相关文章所有 7 个版本

[PDF] arxiv.org

Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning

D Yarats, D Brandfonbrener, H Liu, M Laskin… - arXiv preprint arXiv …, 2022 - arxiv.org

Recent progress in deep learning has relied on access to large and diverse datasets. Such
data-driven progress has been less evident in offline reinforcement learning (RL), because …

被引用次数：79 相关文章所有 3 个版本

[PDF] arxiv.org

Dopamine: A research framework for deep reinforcement learning

PS Castro, S Moitra, C Gelada, S Kumar… - arXiv preprint arXiv …, 2018 - arxiv.org

Deep reinforcement learning (deep RL) research has grown significantly in recent years. A
number of software offerings now exist that provide stable, comprehensive implementations …

被引用次数：277 相关文章所有 3 个版本

[PDF] arxiv.org

Sample-efficient automated deep reinforcement learning

JKH Franke, G Köhler, A Biedenkapp… - arXiv preprint arXiv …, 2020 - arxiv.org

Despite significant progress in challenging problems across various domains, applying state-
of-the-art deep reinforcement learning (RL) algorithms remains challenging due to their …

被引用次数：40 相关文章所有 7 个版本

[PDF] aaai.org

Deep reinforcement learning that matters

P Henderson, R Islam, P Bachman, J Pineau… - Proceedings of the …, 2018 - ojs.aaai.org

In recent years, significant progress has been made in solving challenging problems across
various domains using deep reinforcement learning (RL). Reproducing existing work and …

被引用次数：2341 相关文章所有 19 个版本

[PDF] arxiv.org

Latent-variable advantage-weighted policy optimization for offline rl

X Chen, A Ghadirzadeh, T Yu, Y Gao, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

Offline reinforcement learning methods hold the promise of learning policies from pre-
collected datasets without the need to query the environment for new transitions. This setting …

被引用次数：26 相关文章所有 4 个版本

[PDF] mlr.press

Semi-supervised offline reinforcement learning with action-free trajectories

Q Zheng, M Henaff, B Amos… - … conference on machine …, 2023 - proceedings.mlr.press

Natural agents can effectively learn from multiple data sources that differ in size, quality, and
types of measurements. We study this heterogeneity in the context of offline reinforcement …

被引用次数：17 相关文章所有 8 个版本

[PDF] neurips.cc

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc

Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

被引用次数：166 相关文章所有 8 个版本

高级搜索

QQ 群

D4rl: Datasets for deep data-driven reinforcement learning

A survey on offline reinforcement learning: Taxonomy, review, and open problems

Awac: Accelerating online reinforcement learning with offline datasets

Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning

Dopamine: A research framework for deep reinforcement learning

Sample-efficient automated deep reinforcement learning

Deep reinforcement learning that matters

Latent-variable advantage-weighted policy optimization for offline rl

Semi-supervised offline reinforcement learning with action-free trajectories

Rl unplugged: A suite of benchmarks for offline reinforcement learning

相关搜索

引用