reinforcement learning empirical analysis- 学术资源搜索

On the generalization gap in reparameterizable reinforcement learning

H Wang, S Zheng, C Xiong… - … on Machine Learning, 2019 - proceedings.mlr.press

… in reinforcement learning remain unsolved. One that draws more and more attention is the
issue of overfitting in reinforcement learning (Sutton… gap in deep reinforcement learning. They …

被引用次数：37 相关文章所有 6 个版本

[PDF] mlr.press

Kernel-based reinforcement learning: A finite-time analysis

OD Domingues, P Ménard, M Pirotta… - … Machine Learning, 2021 - proceedings.mlr.press

… We consider the exploration-exploitation dilemma in finite-horizon reinforcement learning
problems whose state-action space is endowed with a metric. We introduce Kernel-UCBVI, a …

被引用次数：23 相关文章所有 14 个版本

Deep reinforcement learning: A survey

X Wang, S Wang, X Liang, D Zhao… - … and Learning …, 2022 - ieeexplore.ieee.org

… In addition, deep learning has stimulated the further development of many … reinforcement
learning, such as hierarchical reinforcement learning (HRL), multiagent reinforcement learning, …

被引用次数：204 相关文章所有 3 个版本

[PDF] ieee.org

An empirical study of ddpg and ppo-based reinforcement learning algorithms for autonomous driving

S Siboo, A Bhattacharyya, RN Raj, SH Ashwin - IEEE Access, 2023 - ieeexplore.ieee.org

… To make it more efficient in dealing with high dimensional sensory data, Deep Reinforcement
Learning (DRL) with continuous state and action spaces has been developed. The usual …

被引用次数：6 相关文章所有 3 个版本

[PDF] acm.org

SEADS Scalable and Cost-effective Dynamic Dependence Analysis of Distributed Systems via Reinforcement Learning

X Fu, H Cai, W Li, L Li - ACM Transactions on Software Engineering and …, 2020 - dl.acm.org

… analysis. At the core of the automatic adjustment is our application of a reinforcement learning
… to the current configuration and its associated analysis cost with respect to the user budget…

被引用次数：15 相关文章所有 8 个版本

[PDF] neurips.cc

A regularized approach to sparse optimal policy in reinforcement learning

W Yang, X Li, Z Zhang - Advances in Neural Information …, 2019 - proceedings.neurips.cc

… are potentially useful in reinforcement learning. In particular, … We also conduct a full
mathematical analysis of the proposed … We empirically analyze the numerical properties of optimal …

被引用次数：30 相关文章所有 9 个版本

[PDF] neurips.cc

Online reinforcement learning in stochastic games

CY Wei, YT Hong, CJ Lu - Advances in Neural Information …, 2017 - proceedings.neurips.cc

… We study online reinforcement learning in average-reward stochastic games (SGs). An SG
models a two-player zero-sum game in a Markov environment, where state transitions and …

被引用次数：139 相关文章所有 7 个版本

[HTML] nih.gov

Transfer learning in deep reinforcement learning: A survey

Z Zhu, K Lin, AK Jain, J Zhou - … Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

… Abstract—Reinforcement learning is a learning … reinforcement learning upon the fast
development of deep neural networks. Along with the promising prospects of reinforcement learning …

被引用次数：521 相关文章所有 12 个版本

[PDF] arxiv.org

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer

… 2 Single-agent learning This section presents the formalism of reinforcement learning and
its main components before outlining deep reinforcement learning along with its particular …

被引用次数：624 相关文章所有 8 个版本

[PDF] nsf.gov

Evaluating critical reinforcement learning framework in the field

S Ju, G Zhou, M Abdelshiheed, T Barnes… - … conference on artificial …, 2021 - Springer

… Reinforcement Learning (RL) is learning what action to take next by mapping situations to
actions so as to maximize cumulative rewards. In recent years RL has achieved great success …

被引用次数：17 相关文章所有 5 个版本

高级搜索

QQ 群