M Peschl, A Zgonnikov, FA Oliehoek… - AAMAS 2022: 21st …, 2022 - research.tudelft.nl
Inferring reward functions from demonstrations and pairwise preferences are auspicious
approaches for aligning Reinforcement Learning (RL) agents with human intentions …