G Li, L Shi, Y Chen, Y Chi, Y Wei - arXiv preprint arXiv:2204.05275, 2022 - arxiv.org
This paper is concerned with offline reinforcement learning (RL), which learns using pre-
collected data without further exploration. Effective offline RL would be able to accommodate …