Hyperparameter selection for offline reinforcement learning

TL Paine, C Paduraru, A Michi, C Gulcehre… - arXiv preprint arXiv …, 2020 - arxiv.org
Offline reinforcement learning (RL purely from logged data) is an important avenue for
deploying RL techniques in real-world scenarios. However, existing hyperparameter …

Hyperparameter Selection for Offline Reinforcement Learning

T Le Paine, C Paduraru, A Michi, C Gulcehre… - arXiv e …, 2020 - ui.adsabs.harvard.edu
Offline reinforcement learning (RL purely from logged data) is an important avenue for
deploying RL techniques in real-world scenarios. However, existing hyperparameter …