Expected policy gradients for reinforcement learning

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1668 相关文章所有 8 个版本

[PDF] arxiv.org

A review of cooperative multi-agent deep reinforcement learning

A Oroojlooy, D Hajinezhad - Applied Intelligence, 2023 - Springer

Abstract Deep Reinforcement Learning has made significant progress in multi-agent
systems in recent years. The aim of this review article is to provide an overview of recent …

被引用次数：497 相关文章所有 8 个版本

[PDF] neurips.cc

Better exploration with optimistic actor critic

K Ciosek, Q Vuong, R Loftin… - Advances in Neural …, 2019 - proceedings.neurips.cc

Actor-critic methods, a type of model-free Reinforcement Learning, have been successfully
applied to challenging tasks in continuous control, often achieving state-of-the art …

被引用次数：181 相关文章所有 9 个版本

[PDF] neurips.cc

Settling the variance of multi-agent policy gradients

JG Kuba, M Wen, L Meng, H Zhang… - Advances in …, 2021 - proceedings.neurips.cc

Policy gradient (PG) methods are popular reinforcement learning (RL) methods where a
baseline is often applied to reduce the variance of gradient estimates. In multi-agent RL …

被引用次数：66 相关文章所有 10 个版本

[PDF] ieee.org

A survey on policy search algorithms for learning robot controllers in a handful of trials

K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org

Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …

被引用次数：202 相关文章所有 17 个版本

[PDF] arxiv.org

Decentralized multi-agent reinforcement learning with networked agents: Recent advances

K Zhang, Z Yang, T Başar - Frontiers of Information Technology & …, 2021 - Springer

Multi-agent reinforcement learning (MARL) has long been a significant research topic in
both machine learning and control systems. Recent development of (single-agent) deep …

被引用次数：101 相关文章所有 9 个版本

Networked multi-agent reinforcement learning in continuous spaces

K Zhang, Z Yang, T Basar - 2018 IEEE conference on decision …, 2018 - ieeexplore.ieee.org

Many real-world tasks on practical control systems involve the learning and decision-making
of multiple agents, under limited communications and observations. In this paper, we study …

被引用次数：123 相关文章所有 4 个版本

[PDF] arxiv.org

FedKL: Tackling data heterogeneity in federated reinforcement learning by penalizing KL divergence

Z Xie, S Song - IEEE Journal on Selected Areas in …, 2023 - ieeexplore.ieee.org

One of the fundamental issues for Federated Learning (FL) is data heterogeneity, which
causes accuracy degradation, slow convergence, and the communication bottleneck issue …

被引用次数：49 相关文章所有 4 个版本

Deep deterministic policy gradient with compatible critic network

D Wang, M Hu - IEEE Transactions on Neural Networks and …, 2021 - ieeexplore.ieee.org

Deep deterministic policy gradient (DDPG) is a powerful reinforcement learning algorithm for
large-scale continuous controls. DDPG runs the back-propagation from the state-action …

被引用次数：45 相关文章所有 3 个版本

[PDF] iop.org Full View

A reinforcement learning approach to rare trajectory sampling

DC Rose, JF Mair, JP Garrahan - New Journal of Physics, 2021 - iopscience.iop.org

Very often when studying non-equilibrium systems one is interested in analysing dynamical
behaviour that occurs with very low probability, so called rare events. In practice, since rare …

被引用次数：66 相关文章所有 6 个版本

高级搜索

QQ 群