G Antonov, P Dayan - Reinforcement Learning Conference (RLC …, 2024 - rlj.cs.umass.edu
Epistemic uncertainty, which stems from what a learning algorithm does not know, is the
natural signal for exploration. Capturing and exploiting epistemic uncertainty for efficient …