Deep multiagent reinforcement learning: Challenges and directions

A Wong, T Bäck, AV Kononova, A Plaat - Artificial Intelligence Review, 2023 - Springer
This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …

A review of cooperation in multi-agent learning

Y Du, JZ Leibo, U Islam, R Willis, P Sunehag - arXiv preprint arXiv …, 2023 - arxiv.org
Cooperation in multi-agent learning (MAL) is a topic at the intersection of numerous
disciplines, including game theory, economics, social sciences, and evolutionary biology …

Fop: Factorizing optimal joint policy of maximum-entropy multi-agent reinforcement learning

T Zhang, Y Li, C Wang, G Xie… - … conference on machine …, 2021 - proceedings.mlr.press
Value decomposition recently injects vigorous vitality into multi-agent actor-critic methods.
However, existing decomposed actor-critic methods cannot guarantee the convergence of …

Social diversity and social preferences in mixed-motive reinforcement learning

KR McKee, I Gemp, B McWilliams… - arXiv preprint arXiv …, 2020 - arxiv.org
Recent research on reinforcement learning in pure-conflict and pure-common interest
games has emphasized the importance of population heterogeneity. In contrast, studies of …

Learning to incentivize other learning agents

J Yang, A Li, M Farajtabar, P Sunehag… - Advances in …, 2020 - proceedings.neurips.cc
The challenge of developing powerful and general Reinforcement Learning (RL) agents has
received increasing attention in recent years. Much of this effort has focused on the single …

Intrinsic fluctuations of reinforcement learning promote cooperation

W Barfuss, JM Meylahn - Scientific reports, 2023 - nature.com
In this work, we ask for and answer what makes classical temporal-difference reinforcement
learning with ϵ-greedy strategies cooperative. Cooperating in social dilemma situations is …

Emergent bartering behaviour in multi-agent reinforcement learning

MB Johanson, E Hughes, F Timbers… - arXiv preprint arXiv …, 2022 - arxiv.org
Advances in artificial intelligence often stem from the development of new environments that
abstract real-world situations into a form where research can be done conveniently. This …

Learning to play no-press diplomacy with best response policy iteration

T Anthony, T Eccles, A Tacchetti… - Advances in …, 2020 - proceedings.neurips.cc
Recent advances in deep reinforcement learning (RL) have led to considerable progress in
many 2-player zero-sum games, such as Go, Poker and Starcraft. The purely adversarial …

A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings

E Vinitsky, R Köster, JP Agapiou… - Collective …, 2023 - journals.sagepub.com
Society is characterized by the presence of a variety of social norms: collective patterns of
sanctioning that can prevent miscoordination and free-riding. Inspired by this, we aim to …

How and why to manipulate your own agent: On the incentives of users of learning agents

Y Kolumbus, N Nisan - Advances in Neural Information …, 2022 - proceedings.neurips.cc
The usage of automated learning agents is becoming increasingly prevalent in many online
economic applications such as online auctions and automated trading. Motivated by such …