HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach

S Lian, Y Ma, J Liu, Y Zheng, Z Meng - arXiv preprint arXiv:2306.06329, 2023 - arxiv.org
Offline reinforcement learning (ORL) has gained attention as a means of training
reinforcement learning models using pre-collected static data. To address the issue of …