相关文章- 学术资源搜索

NeoRL: A near real-world benchmark for offline reinforcement learning

RJ Qin, X Zhang, S Gao, XH Chen… - Advances in …, 2022 - proceedings.neurips.cc

Offline reinforcement learning (RL) aims at learning effective policies from historical data
without extra environment interactions. During our experience of applying offline RL, we …

被引用次数：70 相关文章所有 6 个版本

[PDF] neurips.cc

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

被引用次数：643 相关文章所有 6 个版本

[PDF] neurips.cc

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel… - Advances in …, 2020 - proceedings.neurips.cc

Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

被引用次数：301 相关文章所有 9 个版本

[PDF] aaai.org

Weighted policy constraints for offline reinforcement learning

Z Peng, C Han, Y Liu, Z Zhou - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org

Offline reinforcement learning (RL) aims to learn policy from the passively collected offline
dataset. Applying existing RL methods on the static dataset straightforwardly will raise …

被引用次数：11 相关文章所有 2 个版本

[PDF] neurips.cc

Data-efficient pipeline for offline reinforcement learning with limited data

A Nie, Y Flet-Berliac, D Jordan… - Advances in …, 2022 - proceedings.neurips.cc

Offline reinforcement learning (RL) can be used to improve future performance by
leveraging historical data. There exist many different algorithms for offline RL, and it is well …

被引用次数：14 相关文章所有 8 个版本

[PDF] neurips.cc

A closer look at offline rl agents

Y Fu, D Wu, B Boulet - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Despite recent advances in the field of Offline Reinforcement Learning (RL), less attention
has been paid to understanding the behaviors of learned RL agents. As a result, there …

被引用次数：16 相关文章所有 4 个版本

[PDF] mlr.press

Showing your offline reinforcement learning work: Online evaluation budget matters

V Kurenkov, S Kolesnikov - International Conference on …, 2022 - proceedings.mlr.press

In this work, we argue for the importance of an online evaluation budget for a reliable
comparison of deep offline RL algorithms. First, we delineate that the online evaluation …

被引用次数：23 相关文章所有 4 个版本

[PDF] ieee.org

A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

被引用次数：220 相关文章所有 9 个版本

[PDF] neurips.cc

Morel: Model-based offline reinforcement learning

R Kidambi, A Rajeswaran… - Advances in neural …, 2020 - proceedings.neurips.cc

In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based
solely on a dataset of historical interactions with the environment. This serves as an extreme …

被引用次数：654 相关文章所有 7 个版本

[PDF] neurips.cc

Beyond uniform sampling: Offline reinforcement learning with imbalanced datasets

ZW Hong, A Kumar, S Karnik… - Advances in …, 2023 - proceedings.neurips.cc

Offline reinforcement learning (RL) enables learning a decision-making policy without
interaction with the environment. This makes it particularly beneficial in situations where …

被引用次数：6 相关文章所有 6 个版本

高级搜索

QQ 群

NeoRL: A near real-world benchmark for offline reinforcement learning

A minimalist approach to offline reinforcement learning

Critic regularized regression

Weighted policy constraints for offline reinforcement learning

Data-efficient pipeline for offline reinforcement learning with limited data

A closer look at offline rl agents

Showing your offline reinforcement learning work: Online evaluation budget matters

A survey on offline reinforcement learning: Taxonomy, review, and open problems

Morel: Model-based offline reinforcement learning

Beyond uniform sampling: Offline reinforcement learning with imbalanced datasets

相关搜索

引用