C Muslimani, ME Taylor - arXiv preprint arXiv:2405.00746, 2024 - arxiv.org
To create useful reinforcement learning (RL) agents, step zero is to design a suitable reward
function that captures the nuances of the task. However, reward engineering can be a …