The difficulty of passive learning in deep reinforcement learning

G Ostrovski, PS Castro… - Advances in Neural …, 2021 - proceedings.neurips.cc
… paradigm which facilitates our empirical analysis of the difficulties in offline reinforcement
detailed empirical analysis of the failure modes of passive (ie non-interactive, offline) learning, …

How many random seeds? statistical power analysis in deep reinforcement learning experiments

C Colas, O Sigaud, PY Oudeyer - arXiv preprint arXiv:1806.08295, 2018 - arxiv.org
… Reproducibility of benchmarked deep reinforcement learning tasks for continuous control.
In: Proceedings of the ICML 2017 workshop on Reproducibility in Machine Learning (RML). …

An empirical study of representation learning for reinforcement learning in healthcare

TW Killian, H Zhang, J Subramanian, M Fatemi… - arXiv preprint arXiv …, 2020 - arxiv.org
… As we do not have the ability to generate more data through an exploration of novel treatment
strategies, we develop a policy using offline, batch reinforcement learning. In this setting, it …

Using reinforcement learning to validate empirical game-theoretic analysis: A continuous double auction study

M Wright - arXiv preprint arXiv:1604.06710, 2016 - arxiv.org
… I propose using reinforcement learning to analyze the regret … I have developed a new library
of reinforcement learning tools, … Finally, I use our new reinforcement learning tools to provide …

Emphatic algorithms for deep reinforcement learning

R Jiang, T Zahavy, Z Xu, A White… - … Machine Learning, 2021 - proceedings.mlr.press
… We empirically analyze the properties of these new emphatic algorithms to observe how
qualitative properties such as convergence, learning speed and variance manifest in practice. …

An empirical study of crop yield prediction using reinforcement learning

MP Vaishnnave, R Manivannan - … Intelligent Techniques for …, 2022 - Wiley Online Library
… More specifically, reinforcement learning methods, including multiple regressions, random …
crucial aspect of machine learning models. In addition, reinforcement learning is analogous to …

Leverage the average: an analysis of kl regularization in reinforcement learning

N Vieillard, T Kozuno, B Scherrer… - Advances in …, 2020 - proceedings.neurips.cc
… evaluation for the analysis, but we will compare both approaches empirically. Now, we …
the broader impact of our contribution to be the same as the one of reinforcement learning. …

A unified game-theoretic approach to multiagent reinforcement learning

M Lanctot, V Zambaldi, A Gruslys… - Advances in neural …, 2017 - proceedings.neurips.cc
… To identify the effect of overfitting in independent reinforcement learners, we introduce joint
policy correlation (JPC) matrices. To simplify the presentation, we describe here the special …

An analysis of categorical distributional reinforcement learning

M Rowland, M Bellemare, W Dabney… - International …, 2018 - proceedings.mlr.press
Reinforcement learning (RL) formalises the problems of evaluation and optimisation of
an … We refer to these approaches as categorical distributional reinforcement learning (CDRL). …

Steady state analysis of episodic reinforcement learning

H Bojun - Advances in Neural Information Processing …, 2020 - proceedings.neurips.cc
… This work mostly looked at a theoretical foundation of reinforcement learning, which, in
the author’s view, contributes to understand RL as a general and natural phenomenon of the …