On the generalization gap in reparameterizable reinforcement learning

H Wang, S Zheng, C Xiong… - … on Machine Learning, 2019 - proceedings.mlr.press
… in reinforcement learning remain unsolved. One that draws more and more attention is the
issue of overfitting in reinforcement learning (Sutton… gap in deep reinforcement learning. They …

Kernel-based reinforcement learning: A finite-time analysis

OD Domingues, P Ménard, M Pirotta… - … Machine Learning, 2021 - proceedings.mlr.press
… We consider the exploration-exploitation dilemma in finite-horizon reinforcement learning
problems whose state-action space is endowed with a metric. We introduce Kernel-UCBVI, a …

Deep reinforcement learning: A survey

X Wang, S Wang, X Liang, D Zhao… - … and Learning …, 2022 - ieeexplore.ieee.org
… In addition, deep learning has stimulated the further development of many … reinforcement
learning, such as hierarchical reinforcement learning (HRL), multiagent reinforcement learning, …

An empirical study of ddpg and ppo-based reinforcement learning algorithms for autonomous driving

S Siboo, A Bhattacharyya, RN Raj, SH Ashwin - IEEE Access, 2023 - ieeexplore.ieee.org
… To make it more efficient in dealing with high dimensional sensory data, Deep Reinforcement
Learning (DRL) with continuous state and action spaces has been developed. The usual …

SEADS Scalable and Cost-effective Dynamic Dependence Analysis of Distributed Systems via Reinforcement Learning

X Fu, H Cai, W Li, L Li - ACM Transactions on Software Engineering and …, 2020 - dl.acm.org
analysis. At the core of the automatic adjustment is our application of a reinforcement learning
… to the current configuration and its associated analysis cost with respect to the user budget…

A regularized approach to sparse optimal policy in reinforcement learning

W Yang, X Li, Z Zhang - Advances in Neural Information …, 2019 - proceedings.neurips.cc
… are potentially useful in reinforcement learning. In particular, … We also conduct a full
mathematical analysis of the proposed … We empirically analyze the numerical properties of optimal …

Online reinforcement learning in stochastic games

CY Wei, YT Hong, CJ Lu - Advances in Neural Information …, 2017 - proceedings.neurips.cc
… We study online reinforcement learning in average-reward stochastic games (SGs). An SG
models a two-player zero-sum game in a Markov environment, where state transitions and …

Transfer learning in deep reinforcement learning: A survey

Z Zhu, K Lin, AK Jain, J Zhou - … Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
… Abstract—Reinforcement learning is a learningreinforcement learning upon the fast
development of deep neural networks. Along with the promising prospects of reinforcement learning

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer
… 2 Single-agent learning This section presents the formalism of reinforcement learning and
its main components before outlining deep reinforcement learning along with its particular …

Evaluating critical reinforcement learning framework in the field

S Ju, G Zhou, M Abdelshiheed, T Barnes… - … conference on artificial …, 2021 - Springer
Reinforcement Learning (RL) is learning what action to take next by mapping situations to
actions so as to maximize cumulative rewards. In recent years RL has achieved great success …