J Skalse,
A Abate - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
Abstract The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from
a policy pi. To do this, we need a model of how pi relates to R. In the current literature, the …