[PDF][PDF] A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning

J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv… - Proceedings of the 23rd …, 2024 - ifaamas.org
In recent years, offline reinforcement learning (RL) algorithms have gained considerable
attention. However, the role of data sampling techniques in offline RL has been somewhat …

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning

J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv… - arXiv preprint arXiv …, 2023 - arxiv.org
In recent years, data-driven reinforcement learning (RL), also known as offline RL, have
gained significant attention. However, the role of data sampling techniques in offline RL has …

Improving and benchmarking offline reinforcement learning algorithms

B Kang, X Ma, Y Wang, Y Yue, S Yan - arXiv preprint arXiv:2306.00972, 2023 - arxiv.org
Recently, Offline Reinforcement Learning (RL) has achieved remarkable progress with the
emergence of various algorithms and datasets. However, these methods usually focus on …

Data-efficient pipeline for offline reinforcement learning with limited data

A Nie, Y Flet-Berliac, D Jordan… - Advances in …, 2022 - proceedings.neurips.cc
Offline reinforcement learning (RL) can be used to improve future performance by
leveraging historical data. There exist many different algorithms for offline RL, and it is well …

Offline Trajectory Generalization for Offline Reinforcement Learning

Z Zhao, Z Ren, L Yang, F Yuan, P Ren, Z Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Offline reinforcement learning (RL) aims to learn policies from static datasets of previously
collected trajectories. Existing methods for offline RL either constrain the learned policy to …

OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning

F Wu, R Zhang, Q Yi, Y Gao, J Guo, S Peng… - Proceedings of the …, 2024 - ojs.aaai.org
Model-based offline reinforcement learning (RL) algorithms have emerged as a promising
paradigm for offline RL. These algorithms usually learn a dynamics model from a static …

[PDF][PDF] Rl unplugged: Benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - arXiv preprint arXiv …, 2020 - ask.qcloudimg.com
Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

You only evaluate once: a simple baseline algorithm for offline rl

W Goo, S Niekum - Conference on Robot Learning, 2022 - proceedings.mlr.press
The goal of offline reinforcement learning (RL) is to find an optimal policy given prerecorded
trajectories. Many current approaches customize existing off-policy RL algorithms, especially …

DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching

G Li, Y Shan, Z Zhu, T Long, W Zhang - arXiv preprint arXiv:2402.02439, 2024 - arxiv.org
In offline reinforcement learning (RL), the performance of the learned policy highly depends
on the quality of offline datasets. However, in many cases, the offline dataset contains very …

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc
Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …