Z Huang, S Sun, J Zhao - Knowledge-Based Systems, 2024 - Elsevier
Offline reinforcement learning (RL) aims to learn a policy from pre-collected data, avoiding
costly or risky interactions with the environment. In the offline setting, the inherent problem of …