K Li, A Gupta, A Reddy, V Pong, A Zhou, J Yu… - arXiv preprint arXiv …, 2021 - arxiv.org
Exploration in reinforcement learning is a challenging problem: in the worst case, the agent
must search for high-reward states that could be hidden anywhere in the state space. Can …