[PDF][PDF] Offline reinforcement learning in large state spaces: Algorithms and guarantees

N Jiang, T Xie - Statistical Science, 2024 - nanjiang.cs.illinois.edu
This article introduces the theory of offline reinforcement learning in large state spaces,
where good policies are learned from historical data without online interactions with the …

Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations

B Chan, A Leung, J Bergstra - arXiv preprint arXiv:2410.14957, 2024 - arxiv.org
Offline-to-online reinforcement learning (O2O RL) aims to obtain a continually improving
policy as it interacts with the environment, while ensuring the initial policy behaviour is …