J Skalse, A Abate - Proceedings of the Thirty-Seventh AAAI Conference …, 2023 - dl.acm.org
The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from a policy
π. To do this, we need a model of how π relates to R. In the current literature, the most …