V Boone,
B Gaujal - International Conference on Artificial …, 2023 - proceedings.mlr.press
This paper investigates a new learning problem, the identification of Blackwell optimal
policies on deterministic MDPs (DMDPs): A learner has to return a Blackwell optimal policy …