Is centralized training with decentralized execution framework centralized enough for MARL?

Y Zhou, S Liu, Y Qing, K Chen, T Zheng… - arXiv preprint arXiv …, 2023 - arxiv.org
Centralized Training with Decentralized Execution (CTDE) has recently emerged as a
popular framework for cooperative Multi-Agent Reinforcement Learning (MARL), where …

Inducing stackelberg equilibrium through spatio-temporal sequential decision-making in multi-agent reinforcement learning

B Zhang, L Li, Z Xu, D Li, G Fan - arXiv preprint arXiv:2304.10351, 2023 - arxiv.org
In multi-agent reinforcement learning (MARL), self-interested agents attempt to establish
equilibrium and achieve coordination depending on game structure. However, existing …

GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement Learning

Q Hao, W Huang, T Feng, J Yuan, Y Li - Proceedings of the 29th ACM …, 2023 - dl.acm.org
Recent advancements in reinforcement learning have witnessed remarkable achievements
by intelligent agents ranging from game-playing to industrial applications. Of particular …

Towards Intelligent Mobile Crowdsensing With Task State Information Sharing over Edge-Assisted UAV Networks

L Deng, W Gong, M Liwang, L Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
With the rapid development of edge computing technology, edge-assisted unmanned aerial
vehicle (UAV) networks have become popular, helping with fast and cost-effective data …

MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification

Y Chen, J Mao, Y Zhang, D Ma, L Xia, J Fan… - arXiv preprint arXiv …, 2024 - arxiv.org
The objective of search result diversification (SRD) is to ensure that selected documents
cover as many different subtopics as possible. Existing methods primarily utilize a paradigm …

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

J Lian, Y Huang, M Wang, C Ma, Y Hao, Y Wen… - arXiv preprint arXiv …, 2024 - arxiv.org
For solving zero-sum games involving non-transitivity, a common approach is to maintain
population policies to approximate the Nash Equilibrium (NE). Previous research has shown …

Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach

B Zhang, H Mao, L Li, Z Xu, D Li, R Zhao… - Forty-first International … - openreview.net
Asynchronous action coordination presents a pervasive challenge in Multi-Agent Systems
(MAS), which can be represented as a Stackelberg game (SG). However, the scalability of …