Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

A review of cooperative multi-agent deep reinforcement learning

A Oroojlooy, D Hajinezhad - Applied Intelligence, 2023 - Springer
Abstract Deep Reinforcement Learning has made significant progress in multi-agent
systems in recent years. The aim of this review article is to provide an overview of recent …

Better exploration with optimistic actor critic

K Ciosek, Q Vuong, R Loftin… - Advances in Neural …, 2019 - proceedings.neurips.cc
Actor-critic methods, a type of model-free Reinforcement Learning, have been successfully
applied to challenging tasks in continuous control, often achieving state-of-the art …

Settling the variance of multi-agent policy gradients

JG Kuba, M Wen, L Meng, H Zhang… - Advances in …, 2021 - proceedings.neurips.cc
Policy gradient (PG) methods are popular reinforcement learning (RL) methods where a
baseline is often applied to reduce the variance of gradient estimates. In multi-agent RL …

A survey on policy search algorithms for learning robot controllers in a handful of trials

K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …

Decentralized multi-agent reinforcement learning with networked agents: Recent advances

K Zhang, Z Yang, T Başar - Frontiers of Information Technology & …, 2021 - Springer
Multi-agent reinforcement learning (MARL) has long been a significant research topic in
both machine learning and control systems. Recent development of (single-agent) deep …

Networked multi-agent reinforcement learning in continuous spaces

K Zhang, Z Yang, T Basar - 2018 IEEE conference on decision …, 2018 - ieeexplore.ieee.org
Many real-world tasks on practical control systems involve the learning and decision-making
of multiple agents, under limited communications and observations. In this paper, we study …

FedKL: Tackling data heterogeneity in federated reinforcement learning by penalizing KL divergence

Z Xie, S Song - IEEE Journal on Selected Areas in …, 2023 - ieeexplore.ieee.org
One of the fundamental issues for Federated Learning (FL) is data heterogeneity, which
causes accuracy degradation, slow convergence, and the communication bottleneck issue …

Deep deterministic policy gradient with compatible critic network

D Wang, M Hu - IEEE Transactions on Neural Networks and …, 2021 - ieeexplore.ieee.org
Deep deterministic policy gradient (DDPG) is a powerful reinforcement learning algorithm for
large-scale continuous controls. DDPG runs the back-propagation from the state-action …

A reinforcement learning approach to rare trajectory sampling

DC Rose, JF Mair, JP Garrahan - New Journal of Physics, 2021 - iopscience.iop.org
Very often when studying non-equilibrium systems one is interested in analysing dynamical
behaviour that occurs with very low probability, so called rare events. In practice, since rare …