NeoRL: A near real-world benchmark for offline reinforcement learning

RJ Qin, X Zhang, S Gao, XH Chen… - Advances in …, 2022 - proceedings.neurips.cc
Offline reinforcement learning (RL) aims at learning effective policies from historical data
without extra environment interactions. During our experience of applying offline RL, we …

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel… - Advances in …, 2020 - proceedings.neurips.cc
Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

Weighted policy constraints for offline reinforcement learning

Z Peng, C Han, Y Liu, Z Zhou - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org
Offline reinforcement learning (RL) aims to learn policy from the passively collected offline
dataset. Applying existing RL methods on the static dataset straightforwardly will raise …

Data-efficient pipeline for offline reinforcement learning with limited data

A Nie, Y Flet-Berliac, D Jordan… - Advances in …, 2022 - proceedings.neurips.cc
Offline reinforcement learning (RL) can be used to improve future performance by
leveraging historical data. There exist many different algorithms for offline RL, and it is well …

A closer look at offline rl agents

Y Fu, D Wu, B Boulet - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Despite recent advances in the field of Offline Reinforcement Learning (RL), less attention
has been paid to understanding the behaviors of learned RL agents. As a result, there …

Showing your offline reinforcement learning work: Online evaluation budget matters

V Kurenkov, S Kolesnikov - International Conference on …, 2022 - proceedings.mlr.press
In this work, we argue for the importance of an online evaluation budget for a reliable
comparison of deep offline RL algorithms. First, we delineate that the online evaluation …

A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

Morel: Model-based offline reinforcement learning

R Kidambi, A Rajeswaran… - Advances in neural …, 2020 - proceedings.neurips.cc
In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based
solely on a dataset of historical interactions with the environment. This serves as an extreme …

Beyond uniform sampling: Offline reinforcement learning with imbalanced datasets

ZW Hong, A Kumar, S Karnik… - Advances in …, 2023 - proceedings.neurips.cc
Offline reinforcement learning (RL) enables learning a decision-making policy without
interaction with the environment. This makes it particularly beneficial in situations where …