作者
Khoi Khac Nguyen, Trung Q Duong, Ngo Anh Vien, Nhien-An Le-Khac, Long D Nguyen
发表日期
2019/11/8
期刊
IEEE Access
卷号
7
页码范围
164533-164543
出版商
IEEE
简介
Device-to-device (D2D) communication is an emerging technology in the evolution of the 5G network enabled vehicle-to-vehicle (V2V) communications. It is a core technique for the next generation of many platforms and applications, e.g. real-time high-quality video streaming, virtual reality game, and smart city operation. However, the rapid proliferation of user devices and sensors leads to the need for more efficient resource allocation algorithms to enhance network performance while still capable of guaranteeing the quality-of-service. Currently, deep reinforcement learning is rising as a powerful tool to enable each node in the network to have a real-time self-organising ability. In this paper, we present two novel approaches based on deep deterministic policy gradient algorithm, namely “distributed deep deterministic policy gradient” and “sharing deep deterministic policy gradient”, for the multi-agent power …
引用总数
2020202120222023202411131894