R Yang, C Bai, X Ma, Z Wang, C Zhang… - Proceedings of the 36th …, 2022 - dl.acm.org
Offline reinforcement learning (RL) provides a promising direction to exploit massive amount
of offline data for complex decision-making tasks. Due to the distribution shift issue, current …