A Nair, A Gupta, M Dalal, S Levine - arXiv e-prints, 2020 - ui.adsabs.harvard.edu
Reinforcement learning (RL) provides an appealing formalism for learning control policies
from experience. However, the classic active formulation of RL necessitates a lengthy active …