The Best of Both Worlds in Network Population Games: Reaching Consensus and Convergence to Equilibrium

S Hu, H Soh, G Piliouras - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Reaching consensus and convergence to equilibrium are two major challenges of multi-
agent systems. Although each has attracted significant attention, relatively few studies …

Exploration-exploitation in multi-agent learning: Catastrophe theory meets game theory

S Leonardos, G Piliouras - Artificial Intelligence, 2022 - Elsevier
Exploration-exploitation is a powerful and practical tool in multi-agent learning (MAL);
however, its effects are far from understood. To make progress in this direction, we study a …

The dynamics of q-learning in population games: A physics-inspired continuity equation model

S Hu, CW Leung, H Leung, H Soh - arXiv preprint arXiv:2203.01500, 2022 - arxiv.org
Although learning has found wide application in multi-agent systems, its effects on the
temporal evolution of a system are far from understood. This paper focuses on the dynamics …

Routing games in the wild: Efficiency, equilibration, regret, and a price of anarchy bound via long division

B Monnot, F Benita, G Piliouras - ACM Transactions on Economics and …, 2022 - dl.acm.org
Routing games are amongst the most well studied domains of game theory. How relevant
are these pen-and-paper calculations to understanding the reality of everyday traffic …

Scalable nested optimization for deep learning

JP Lorraine - 2024 - search.proquest.com
Gradient-based optimization has been critical to the success of machine learning, updating
a single set of parameters to minimize a single loss. A growing number of applications rely …

Lyapunov exponents for diversity in differentiable games

J Lorraine, P Vicol, J Parker-Holder, T Kachman… - arXiv preprint arXiv …, 2021 - arxiv.org
Ridge Rider (RR) is an algorithm for finding diverse solutions to optimization problems by
following eigenvectors of the Hessian (" ridges"). RR is designed for conservative gradient …

The impact of exploration on convergence and performance of multi-agent Q-learning dynamics

A Hussain, F Belardinelli… - … Conference on Machine …, 2023 - proceedings.mlr.press
Understanding the impact of exploration on the behaviour of multi-agent learning has, so far,
benefited from the restriction to potential, or network zero-sum games in which convergence …

Catastrophe by design in population games: a mechanism to destabilize inefficient locked-in technologies

S Leonardos, J Sakos, C Courcoubetis… - ACM Transactions on …, 2023 - dl.acm.org
In multi-agent environments in which coordination is desirable, the history of play often
causes lock-in at sub-optimal outcomes. Notoriously, technologies with significant …

An Analysis of Logit Learning with the r-Lambert Function

R Gavin, M Cao, K Paarporn - arXiv preprint arXiv:2409.05044, 2024 - arxiv.org
The well-known replicator equation in evolutionary game theory describes how population-
level behaviors change over time when individuals make decisions using simple imitation …

Formal Modeling of Reinforcement Learning with Many Agents through Repeated Local Interactions

CW Leung, S Hu, HF Leung - 2021 IEEE 33rd International …, 2021 - ieeexplore.ieee.org
Modelling the dynamics of multi-agent reinforcement learning has long been an important
research topic. Most of the previous works focus on agents learning under global …