M Andrychowicz, A Raichuk, P Stańczyk… - ICLR 2021-Ninth …, 2021 - inria.hal.science
In recent years, on-policy reinforcement learning (RL) has been successfully applied to
many different continuous control tasks. While RL algorithms are often conceptually simple …