Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

[PDF][PDF] Reinforcement learning in stationary mean-field games

J Subramanian, A Mahajan - … of the 18th International Conference on …, 2019 - cim.mcgill.ca
Multi-agent reinforcement learning (MARL) refers to systems in which multiple agents are
acting in a common and unknown environment. The presence of other agents makes MARL …

Approximate information state for approximate planning and reinforcement learning in partially observed systems

J Subramanian, A Sinha, R Seraj, A Mahajan - Journal of Machine …, 2022 - jmlr.org
We propose a theoretical framework for approximate planning and learning in partially
observed systems. Our framework is based on the fundamental notion of information state …

Mean-field-type games in engineering

B Djehiche, A Tcheukam, H Tembine - arXiv preprint arXiv:1605.03281, 2016 - arxiv.org
A mean-field-type game is a game in which the instantaneous payoffs and/or the state
dynamics functions involve not only the state and the action profile but also the joint …

On team decision problems with nonclassical information structures

AA Malikopoulos - IEEE Transactions on Automatic Control, 2022 - ieeexplore.ieee.org
In this article, we consider sequential dynamic team decision problems with nonclassical
information structures. First, we address the problem from the point of view of a “manager” …

Team-optimal solution of finite number of mean-field coupled LQG subsystems

J Arabneydi, A Mahajan - 2015 54th IEEE conference on …, 2015 - ieeexplore.ieee.org
A decentralized control system with linear dynamics, quadratic cost, and Gaussian
disturbances is considered. The system consists of a finite number of subsystems whose …

Zero-Sum Games between Mean-Field Teams: Reachability-Based Analysis under Mean-Field Sharing

Y Guan, M Afshari, P Tsiotras - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
This work studies the behaviors of two large-population teams competing in a discrete
environment. The team-level interactions are modeled as a zero-sum game while the agent …

Abstracting imperfect information away from two-player zero-sum games

S Sokota, R D'Orazio, CK Ling, DJ Wu… - International …, 2023 - proceedings.mlr.press
In their seminal work, Nayyar et al.(2013) showed that imperfect information can be
abstracted away from common-payoff games by having players publicly announce their …

Solving common-payoff games with approximate policy iteration

S Sokota, E Lockhart, F Timbers, E Davoodi… - Proceedings of the …, 2021 - ojs.aaai.org
For artificially intelligent learning systems to have widespread applicability in real-world
settings, it is important that they be able to operate decentrally. Unfortunately, decentralized …

Deep teams: Decentralized decision making with finite and infinite number of agents

J Arabneydi, AG Aghdam - IEEE Transactions on Automatic …, 2020 - ieeexplore.ieee.org
Inspired by the concepts of deep learning in artificial intelligence and fairness in behavioral
economics, we introduce deep teams in this article. In such systems, agents are partitioned …