D Sadigh, AD Dragan, S Sastry, SA Seshia - m.roboticsproceedings.org
Our goal is to efficiently learn reward functions encoding a human's preferences for how a
dynamical system should act. There are two challenges with this. First, in many problems it is …