N Vieillard, T Kozuno, B Scherrer… - Advances in …, 2020 - proceedings.neurips.cc
… (RL) algorithms making use of KullbackLeibler (KL) regularization as a core component … about why KL regularization helps, so far. We study KL regularization within an approximate …
Y Wu, G Tucker, O Nachum - arXiv preprint arXiv:1911.11361, 2019 - arxiv.org
… We focus on behavior regularized approaches applied to … We introduce behavior regularized actor critic (BRAC), a general … variants of the behavior regularized approach can yield good …
E Derman, S Mannor - arXiv preprint arXiv:2003.02894, 2020 - arxiv.org
… between robustness and regularization in classification and … why regularization generalizes well in statistical learning. … , existing regularization methods in reinforcementlearning (…
… objectives, eg weight regularization, noise injection [28], or … observations, as well as regularizing the Q-function learned by … two simple mechanisms for regularizing the value function …
K Wang, B Kang, J Shao… - Advances in Neural …, 2020 - proceedings.neurips.cc
… 3.2 Mixture regularization Inspired by the success of mixup in supervised learning [32], we introduce mixture regularization (mixreg) to increase the diversity of limited training data and …
… Despite regularization being largely underutilized in deep reinforcementlearning, we show … deep neural networks to improve robustness and adaptability, we apply regularization …
… regularized using the same tools as in supervised learning (eg, weight decay, dropout), we will use “regularization… Such regularization methods include policy regularization (penalizing …
… In this work, we propose a policy gradient algorithm, CORERL (COntrol REgularized ReinforcementLearning), that utilizes a functional regularizer around a, typically suboptimal, control …
We propose a simple data augmentation technique that can be applied to standard model-free reinforcementlearning algorithms, enabling robust learning directly from pixels without …