Meta-world: A benchmark and evaluation for multi-task and meta reinforcement learning

T Yu, D Quillen, Z He, R Julian… - … on robot learning, 2020 - proceedings.mlr.press
… and multitask learning consisting of 50 distinct robotic manipulation tasks. Our aim is to … We
evaluate 6 state-of-the-art meta-reinforcement learning and multi-task learning algorithms on …

Benchmarking deep reinforcement learning for continuous control

Y Duan, X Chen, R Houthooft… - … machine learning, 2016 - proceedings.mlr.press
… In this work, we present a benchmark suite of continuous control tasks, including classic …
evaluation of a range of implemented reinforcement learning algorithms. Both the benchmark

A review for deep reinforcement learning in atari: Benchmarks, challenges, and solutions

J Fan - arXiv preprint arXiv:2112.04145, 2021 - arxiv.org
… This paper will propose more comprehensive and reasonable evaluation metrics for the
Atari benchmark to test the real superhuman reinforcement learning algorithms. Learning

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc
… In this paper, we propose a benchmark called RL Unplugged to evaluate and compare
offline RL methods. RL Unplugged includes data from a diverse range of domains including …

Benchmarking reinforcement learning algorithms on real-world robots

AR Mahmood, D Korenkevych… - … on robot learning, 2018 - proceedings.mlr.press
… , we introduce six reinforcement learning tasks based on three … benchmark four
reinforcement learning algorithms for continuous control: TRPO, PPO, DDPG, and Soft Q-learning

Benchmarking batch deep reinforcement learning algorithms

S Fujimoto, E Conti, M Ghavamzadeh… - arXiv preprint arXiv …, 2019 - arxiv.org
… algorithms have been shown to fail in the batch setting–learning from a fixed data set … In
this paper, we benchmark the performance of recent off-policy and batch reinforcement learning

Natural environment benchmarks for reinforcement learning

A Zhang, Y Wu, J Pineau - arXiv preprint arXiv:1811.06032, 2018 - arxiv.org
… While current benchmark reinforcement learning (RL) tasks … in many ways poor substitutes
for learning with real-world data. By … Most benchmarks and datasets used to evaluate machine …

NeoRL: A near real-world benchmark for offline reinforcement learning

RJ Qin, X Zhang, S Gao, XH Chen… - Advances in …, 2022 - proceedings.neurips.cc
… We then evaluate recent state-of-the-art offline RL algorithms on NeoRL. The empirical … in
the previous benchmarks. We also disclose that current offline policy evaluation methods could …

Urlb: Unsupervised reinforcement learning benchmark

M Laskin, D Yarats, H Liu, K Lee, A Zhan, K Lu… - arXiv preprint arXiv …, 2021 - arxiv.org
Reinforcement Learning setting where an agent’s interaction with the environment is modeled
through a Markov Decision Process (MDP) [63]. In this work, we benchmark … to evaluate a …

Benchmarking multi-agent deep reinforcement learning algorithms in cooperative tasks

G Papoudakis, F Christianos, L Schäfer… - arXiv preprint arXiv …, 2020 - arxiv.org
… We evaluate algorithms in two matrix games and four multi-agent environments, in … benchmark
results, we analyse and discuss insights regarding the effectiveness of different learning