Z Zhang,
S Du, X Ji - International Conference on Machine …, 2021 - proceedings.mlr.press
We study the reward-free reinforcement learning framework, which is particularly suitable for
batch reinforcement learning and scenarios where one needs policies for multiple reward …