E Chalmers, EB Contreras, B Robertson… - IEEE Transactions on …, 2017 - europepmc.org
The reinforcement learning (RL) paradigm allows agents to solve tasks through trial-and-
error learning. To be capable of efficient, long-term learning, RL agents should be able to …