Z Peng, C Han, Y Liu, Z Zhou - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org
Offline reinforcement learning (RL) aims to learn policy from the passively collected offline
dataset. Applying existing RL methods on the static dataset straightforwardly will raise …