Deep reinforcement learning for trajectory design and power allocation in UAV networks

N Zhao, Y Cheng, Y Pei, YC Liang… - ICC 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
ICC 2020-2020 IEEE International Conference on Communications (ICC), 2020ieeexplore.ieee.org
Unmanned aerial vehicle (UAV) is considered to be a key component in the next-generation
cellular networks. Considering the non-convex characteristic of the trajectory design and
power allocation problem, it is difficult to obtain the optimal joint strategy in UAV-assisted
cellular networks. In this paper, a reinforcement learning-based approach is proposed to
obtain the maximum long-term network utility while meeting with user equipments' quality of
service requirement. The Markov decision process (MDP) is formulated with the design of …
Unmanned aerial vehicle (UAV) is considered to be a key component in the next-generation cellular networks. Considering the non-convex characteristic of the trajectory design and power allocation problem, it is difficult to obtain the optimal joint strategy in UAV-assisted cellular networks. In this paper, a reinforcement learning-based approach is proposed to obtain the maximum long-term network utility while meeting with user equipments' quality of service requirement. The Markov decision process (MDP) is formulated with the design of state, action space, and reward function. In order to achieve the joint optimal policy of trajectory design and power allocation, deep reinforcement learning approach is investigated. Due to the continuous action space of the MDP model, deep deterministic policy gradient approach is presented. Simulation results show that the proposed algorithm outperforms other approaches on overall network utility performance with higher system capacity and faster processing speed.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果