S Hu,
L Shen, Y Zhang,
D Tao - arXiv preprint arXiv:2303.03747, 2023 - arxiv.org
Offline reinforcement learning (RL) is a challenging task, whose objective is to learn policies
from static trajectory data without interacting with the environment. Recently, offline RL has …