K De Asis, JF Hernandez-Garcia, GZ Holland… - arXiv preprint arXiv …, 2017 - arxiv.org
Unifying seemingly disparate algorithmic ideas to produce better performing algorithms has
been a longstanding goal in reinforcement learning. As a primary example, TD ($\lambda $) …