作者
Mohit K Sharma, Alessio Zappone, Mohamad Assaad, Mérouane Debbah, Spyridon Vassilaras
发表日期
2019/10/25
期刊
IEEE Transactions on Cognitive Communications and Networking
卷号
5
期号
4
页码范围
1140-1154
出版商
IEEE
简介
In this paper, we develop a multi-agent reinforcement learning (MARL) framework to obtain online power control policies for a large energy harvesting (EH) multiple access channel, when only causal information about the EH process and wireless channel is available. In the proposed framework, we model the online power control problem as a discrete-time mean-field game (MFG), and analytically show that the MFG has a unique stationary solution. Next, we leverage the fictitious play property of the mean-field games, and the deep reinforcement learning technique to learn the stationary solution of the game, in a completely distributed fashion. We analytically show that the proposed procedure converges to the unique stationary solution of the MFG. This, in turn, ensures that the optimal policies can be learned in a completely distributed fashion. In order to benchmark the performance of the distributed policies, we …
引用总数
20192020202120222023202448151293
学术搜索中的文章
MK Sharma, A Zappone, M Assaad, M Debbah… - IEEE Transactions on Cognitive Communications and …, 2019