Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes

L Mandal, C Lakshminarayanan… - arXiv preprint arXiv …, 2023 - arxiv.org
In this work, we consider acooperative'multi-agent Markov decision process (MDP) involving
m greater than 1 agents, where all agents are aware of the system model. At each decision …