Policy mirror ascent for efficient and independent learning in mean field games

M Laurière, S Perrin, M Geist… - arXiv preprint arXiv …, 2022 - researchgate.net

Non-cooperative and cooperative games with a very large number of players have many
applications but remain generally intractable when the number of players increases …

被引用次数：56 相关文章所有 2 个版本

[PDF] neurips.cc

A novel framework for policy mirror descent with general parameterization and linear convergence

C Alfano, R Yuan, P Rebeschini - Advances in Neural …, 2023 - proceedings.neurips.cc

Modern policy optimization methods in reinforcement learning, such as TRPO and PPO, owe
their success to the use of parameterized policies. However, while theoretical guarantees …

被引用次数：20 相关文章所有 8 个版本

[PDF] mlr.press

Oracle-free reinforcement learning in mean-field games along a single sample path

MAU Zaman, A Koppel, S Bhatt… - … Conference on Artificial …, 2023 - proceedings.mlr.press

We consider online reinforcement learning in Mean-Field Games (MFGs). Unlike traditional
approaches, we alleviate the need for a mean-field oracle by developing an algorithm that …

被引用次数：22 相关文章所有 5 个版本

[PDF] neurips.cc

Learning regularized monotone graphon mean-field games

F Zhang, V Tan, Z Wang… - Advances in Neural …, 2023 - proceedings.neurips.cc

This paper studies two fundamental problems in regularized Graphon Mean-Field Games
(GMFGs). First, we establish the existence of a Nash Equilibrium (NE) of any $\lambda …

被引用次数：8 相关文章所有 5 个版本

[PDF] neurips.cc

On imitation in mean-field games

G Ramponi, P Kolev, O Pietquin, N He… - Advances in …, 2024 - proceedings.neurips.cc

We explore the problem of imitation learning (IL) in the context of mean-field games (MFGs),
where the goal is to imitate the behavior of a population of agents following a Nash …

被引用次数：2 相关文章所有 8 个版本

[PDF] arxiv.org

When is Mean-Field Reinforcement Learning Tractable and Relevant?

B Yardim, A Goldman, N He - arXiv preprint arXiv:2402.05757, 2024 - arxiv.org

Mean-field reinforcement learning has become a popular theoretical framework for efficiently
approximating large-scale multi-agent reinforcement learning (MARL) problems exhibiting …

被引用次数：7 相关文章所有 6 个版本

[PDF] mlr.press

On the statistical efficiency of mean-field reinforcement learning with general function approximation

J Huang, B Yardim, N He - International Conference on …, 2024 - proceedings.mlr.press

In this paper, we study the fundamental statistical efficiency of Reinforcement Learning in
Mean-Field Control (MFC) and Mean-Field Game (MFG) with general model-based function …

被引用次数：12 相关文章所有 4 个版本

[PDF] researchgate.net

[PDF][PDF] Major-Minor Mean Field Multi-Agent Reinforcement Learning

K Cui, C Fabian, A Tahir, H Koeppl - arXiv preprint arXiv …, 2023 - researchgate.net

Recently, mean field control (MFC) has provided a tractable and theoretically founded
approach to otherwise difficult cooperative multi-agent control. However, the strict …

被引用次数：9 相关文章所有 2 个版本

[PDF] aaai.org

Learning Discrete-Time Major-Minor Mean Field Games

K Cui, G Dayanıklı, M Laurière, M Geist… - Proceedings of the …, 2024 - ojs.aaai.org

Recent techniques based on Mean Field Games (MFGs) allow the scalable analysis of multi-
player games with many similar, rational agents. However, standard MFGs remain limited to …

被引用次数：5 相关文章所有 4 个版本

[PDF] mlr.press

Robust cooperative multi-agent reinforcement learning: A mean-field type game perspective

MAU Zaman, M Lauriere, A Koppel… - 6th Annual Learning …, 2024 - proceedings.mlr.press

In this paper, we study the problem of robust cooperative multi-agent reinforcement learning
(RL) where a large number of cooperative agents with distributed information aim to learn …

被引用次数：1 相关文章

高级搜索

QQ 群