X Chen, A Ghadirzadeh, T Yu, J Wang, Y Gao… - Proceedings of the 36th …, 2022 - dl.acm.org
Offline reinforcement learning methods hold the promise of learning policies from pre-
collected datasets without the need to query the environment for new samples. This setting …