D Qiao,
YX Wang - Advances in Neural Information …, 2024 - proceedings.neurips.cc
The offline reinforcement learning (RL) problem is often motivated by the need to learn data-
driven decision policies in financial, legal and healthcare applications. However, the learned …