Regret minimization and convergence to equilibria in general-sum markov games

L Erez, T Lancewicki, U Sherman… - International …, 2023 - proceedings.mlr.press
An abundance of recent impossibility results establish that regret minimization in Markov
games with adversarial opponents is both statistically and computationally intractable …

A unified stochastic approximation framework for learning in games

P Mertikopoulos, YP Hsieh, V Cevher - Mathematical Programming, 2024 - Springer
We develop a flexible stochastic approximation framework for analyzing the long-run
behavior of learning in games (both continuous and finite). The proposed analysis template …

Optimistic policy gradient in multi-player markov games with a single controller: Convergence beyond the minty property

I Anagnostides, I Panageas, G Farina… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Policy gradient methods enjoy strong practical performance in numerous tasks in
reinforcement learning. Their theoretical understanding in multiagent settings, however …

Independent Learning in Constrained Markov Potential Games

P Jordan, A Barakat, N He - International Conference on …, 2024 - proceedings.mlr.press
Constrained Markov games offer a formal mathematical framework for modeling multi-agent
reinforcement learning problems where the behavior of the agents is subject to constraints …

Learning to steer markovian agents under model uncertainty

J Huang, V Thoma, Z Shen, HH Nax, N He - arXiv preprint arXiv …, 2024 - arxiv.org
Designing incentives for an adapting population is a ubiquitous problem in a wide array of
economic applications and beyond. In this work, we study how to design additional rewards …

Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games

J Dong, B Wang, Y Yu - International Conference on Artificial …, 2024 - proceedings.mlr.press
In this work, we study potential games and Markov potential games under stochastic cost
and bandit feedback. We propose a variant of the Frank-Wolfe algorithm with sufficient …

Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey

J Jiang, K Su, Z Lu - arXiv preprint arXiv:2401.04934, 2024 - arxiv.org
Cooperative multi-agent reinforcement learning is a powerful tool to solve many real-world
cooperative tasks, but restrictions of real-world applications may require training the agents …

Networked policy gradient play in markov potential games

S Aydın, C Eksin - … 2023-2023 IEEE International Conference on …, 2023 - ieeexplore.ieee.org
We propose a networked policy gradient play algorithm for solving Markov potential games.
In a Markov game, each agent has a reward function that depends on the actions of all the …

Decision-Making in multi-agent systems: delays, adaptivity, and learning in games

YG Hsieh - 2023 - theses.hal.science
With the increasing deployment of decision-making and learning algorithms in multi-agent
systems, it becomes imperative to understand their efficiency and improve their …

Almost Sure Convergence of Networked Policy Gradient over Time-Varying Networks in Markov Potential Games

S Aydin, C Eksin - arXiv preprint arXiv:2410.20075, 2024 - arxiv.org
We propose networked policy gradient play for solving Markov potential games including
continuous action and state spaces. In the decentralized algorithm, agents sample their …