FM Luo, X Cao, RJ Qin, Y Yu - arXiv preprint arXiv:2206.00238, 2022 - arxiv.org
Recovering reward function from expert demonstrations is a fundamental problem in
reinforcement learning. The recovered reward function captures the motivation of the expert …