Hyperparameters in reinforcement learning and how to tune them

T Eimer, M Lindauer… - … Conference on Machine …, 2023 - proceedings.mlr.press
In order to improve reproducibility, deep reinforcement learning (RL) has been adopting
better scientific practices such as standardized evaluation metrics and reporting. However …

Sample-efficient automated deep reinforcement learning

JKH Franke, G Köhler, A Biedenkapp… - arXiv preprint arXiv …, 2020 - arxiv.org
Despite significant progress in challenging problems across various domains, applying state-
of-the-art deep reinforcement learning (RL) algorithms remains challenging due to their …

Benchmarking model-based reinforcement learning

T Wang, X Bao, I Clavera, J Hoang, Y Wen… - arXiv preprint arXiv …, 2019 - arxiv.org
Model-based reinforcement learning (MBRL) is widely seen as having the potential to be
significantly more sample efficient than model-free RL. However, research in model-based …

On inductive biases in deep reinforcement learning

M Hessel, H van Hasselt, J Modayil, D Silver - arXiv preprint arXiv …, 2019 - arxiv.org
Many deep reinforcement learning algorithms contain inductive biases that sculpt the
agent's objective and its interface to the environment. These inductive biases can take many …

Mushroomrl: Simplifying reinforcement learning research

C D'Eramo, D Tateo, A Bonarini, M Restelli… - Journal of Machine …, 2021 - jmlr.org
MushroomRL is an open-source Python library developed to simplify the process of
implementing and running Reinforcement Learning (RL) experiments. Compared to other …

Deep reinforcement learning with robust and smooth policy

Q Shen, Y Li, H Jiang, Z Wang… - … Conference on Machine …, 2020 - proceedings.mlr.press
Deep reinforcement learning (RL) has achieved great empirical successes in various
domains. However, the large search space of neural networks requires a large amount of …

Improving generalization in reinforcement learning with mixture regularization

K Wang, B Kang, J Shao… - Advances in Neural …, 2020 - proceedings.neurips.cc
Deep reinforcement learning (RL) agents trained in a limited set of environments tend to
suffer overfitting and fail to generalize to unseen testing environments. To improve their …

Model-based or model-free, a review of approaches in reinforcement learning

Q Huang - 2020 International Conference on Computing and …, 2020 - ieeexplore.ieee.org
Reinforcement learning (RL) algorithms can successfully solve a wide range of problems
that we faced. Because of the Alpha Go against KeJie in 2017, the topic of RL has reached …

Assessing generalization in deep reinforcement learning

C Packer, K Gao, J Kos, P Krähenbühl, V Koltun… - arXiv preprint arXiv …, 2018 - arxiv.org
Deep reinforcement learning (RL) has achieved breakthrough results on many tasks, but
agents often fail to generalize beyond the environment they were trained in. As a result …

A general framework for sample-efficient function approximation in reinforcement learning

Z Chen, CJ Li, A Yuan, Q Gu, MI Jordan - arXiv preprint arXiv:2209.15634, 2022 - arxiv.org
With the increasing need for handling large state and action spaces, general function
approximation has become a key technique in reinforcement learning (RL). In this paper, we …