[HTML][HTML] Multi-agent reinforcement learning: A review of challenges and applications

L Canese, GC Cardarilli, L Di Nunzio, R Fazzolari… - Applied Sciences, 2021 - mdpi.com
In this review, we present an analysis of the most used multi-agent reinforcement learning
algorithms. Starting with the single-agent reinforcement learning algorithms, we focus on the …

Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

The surprising effectiveness of ppo in cooperative multi-agent games

C Yu, A Velu, E Vinitsky, J Gao… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Proximal Policy Optimization (PPO) is a ubiquitous on-policy reinforcement learning
algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent …

[HTML][HTML] Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer
The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

Multi-agent reinforcement learning is a sequence modeling problem

M Wen, J Kuba, R Lin, W Zhang… - Advances in …, 2022 - proceedings.neurips.cc
Large sequence models (SM) such as GPT series and BERT have displayed outstanding
performance and generalization capabilities in natural language process, vision and …

Qplex: Duplex dueling multi-agent q-learning

J Wang, Z Ren, T Liu, Y Yu, C Zhang - arXiv preprint arXiv:2008.01062, 2020 - arxiv.org
We explore value-based multi-agent reinforcement learning (MARL) in the popular
paradigm of centralized training with decentralized execution (CTDE). CTDE has an …

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Is independent learning all you need in the starcraft multi-agent challenge?

CS De Witt, T Gupta, D Makoviichuk… - arXiv preprint arXiv …, 2020 - arxiv.org
Most recently developed approaches to cooperative multi-agent reinforcement learning in
the\emph {centralized training with decentralized execution} setting involve estimating a …

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arXiv preprint arXiv:2011.00583, 2020 - arxiv.org
Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Towards autonomous multi-UAV wireless network: A survey of reinforcement learning-based approaches

Y Bai, H Zhao, X Zhang, Z Chang… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
Unmanned aerial vehicle (UAV)-based wireless networks have received increasing
research interest in recent years and are gradually being utilized in various aspects of our …