C Xu, R Zhu, D Yang - 2021 International Conference on …, 2021 - ieeexplore.ieee.org
Proximal Policy Optimization (PPO) is a classical algorithm in reinforcement learning, which
has been tested in a collection of benchmark tasks. In this paper, we test PPO in Unity …