查看文章

mlr.press 中的 [PDF]

Fully decentralized multi-agent reinforcement learning with networked agents

作者

Kaiqing Zhang, Zhuoran Yang, Han Liu, Tong Zhang, Tamer Basar

发表日期

2018/7/3

研讨会论文

International Conference on Machine Learning

页码范围

5872-5881

出版商

PMLR

简介

We consider the fully decentralized multi-agent reinforcement learning (MARL) problem, where the agents are connected via a time-varying and possibly sparse communication network. Specifically, we assume that the reward functions of the agents might correspond to different tasks, and are only known to the corresponding agent. Moreover, each agent makes individual decisions based on both the information observed locally and the messages received from its neighbors over the network. To maximize the globally averaged return over the network, we propose two fully decentralized actor-critic algorithms, which are applicable to large-scale MARL problems in an online fashion. Convergence guarantees are provided when the value functions are approximated within the class of linear functions. Our work appears to be the first theoretical study of fully decentralized MARL algorithms for networked agents that use function approximation.

引用总数

被引用次数：646

201820192020202120222023202419 52 93 131 140 143 67

学术搜索中的文章

Fully decentralized multi-agent reinforcement learning with networked agents

K Zhang, Z Yang, H Liu, T Zhang, T Basar - International Conference on Machine Learning, 2018

被引用次数：646 相关文章所有 11 个版本