Reward Machines for Deep RL in Noisy and Uncertain Environments

AC Li, Z Chen, TQ Klassen, P Vaezipoor… - arXiv preprint arXiv …, 2024 - arxiv.org
Reward Machines provide an automata-inspired structure for specifying instructions, safety
constraints, and other temporally extended reward-worthy behaviour. By exposing complex …