相关文章- 学术资源搜索

[PDF][PDF] A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning

J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv… - Proceedings of the 23rd …, 2024 - ifaamas.org

In recent years, offline reinforcement learning (RL) algorithms have gained considerable
attention. However, the role of data sampling techniques in offline RL has been somewhat …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning

J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv… - arXiv preprint arXiv …, 2023 - arxiv.org

In recent years, data-driven reinforcement learning (RL), also known as offline RL, have
gained significant attention. However, the role of data sampling techniques in offline RL has …

Improving and benchmarking offline reinforcement learning algorithms

B Kang, X Ma, Y Wang, Y Yue, S Yan - arXiv preprint arXiv:2306.00972, 2023 - arxiv.org

Recently, Offline Reinforcement Learning (RL) has achieved remarkable progress with the
emergence of various algorithms and datasets. However, these methods usually focus on …

被引用次数：2 相关文章所有 2 个版本

[PDF] neurips.cc

Data-efficient pipeline for offline reinforcement learning with limited data

A Nie, Y Flet-Berliac, D Jordan… - Advances in …, 2022 - proceedings.neurips.cc

Offline reinforcement learning (RL) can be used to improve future performance by
leveraging historical data. There exist many different algorithms for offline RL, and it is well …

被引用次数：15 相关文章所有 7 个版本

[PDF] arxiv.org

Offline Trajectory Generalization for Offline Reinforcement Learning

Z Zhao, Z Ren, L Yang, F Yuan, P Ren, Z Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

Offline reinforcement learning (RL) aims to learn policies from static datasets of previously
collected trajectories. Existing methods for offline RL either constrain the learned policy to …

OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning

F Wu, R Zhang, Q Yi, Y Gao, J Guo, S Peng… - Proceedings of the …, 2024 - ojs.aaai.org

Model-based offline reinforcement learning (RL) algorithms have emerged as a promising
paradigm for offline RL. These algorithms usually learn a dynamics model from a static …

[PDF] qcloudimg.com

[PDF][PDF] Rl unplugged: Benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - arXiv preprint arXiv …, 2020 - ask.qcloudimg.com

Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

被引用次数：54 相关文章

[PDF] mlr.press

You only evaluate once: a simple baseline algorithm for offline rl

W Goo, S Niekum - Conference on Robot Learning, 2022 - proceedings.mlr.press

The goal of offline reinforcement learning (RL) is to find an optimal policy given prerecorded
trajectories. Many current approaches customize existing off-policy RL algorithms, especially …

被引用次数：14 相关文章所有 6 个版本

[PDF] arxiv.org

DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching

G Li, Y Shan, Z Zhu, T Long, W Zhang - arXiv preprint arXiv:2402.02439, 2024 - arxiv.org

In offline reinforcement learning (RL), the performance of the learned policy highly depends
on the quality of offline datasets. However, in many cases, the offline dataset contains very …

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc

Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

被引用次数：113 相关文章所有 7 个版本

高级搜索

QQ 群

[PDF][PDF] A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning

Improving and benchmarking offline reinforcement learning algorithms

Data-efficient pipeline for offline reinforcement learning with limited data

Offline Trajectory Generalization for Offline Reinforcement Learning

OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning

[PDF][PDF] Rl unplugged: Benchmarks for offline reinforcement learning

You only evaluate once: a simple baseline algorithm for offline rl

DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching

Rl unplugged: A suite of benchmarks for offline reinforcement learning

引用