J Shi, X Wang, M Zhang, M Liu, J Zhu, Q Wu - Complex & Intelligent …, 2024 - Springer
Policy Gradient (PG) method is one of the most popular algorithms in Reinforcement
Learning (RL). However, distributed adaptive variants of PG are rarely studied in multi …