Learning and Planning with the Average-Reward Formulation

Y Wan - 2023 - era.library.ualberta.ca
The average-reward formulation is a natural and important formulation of learning and
planning problems, yet has received much less attention than the episodic and discounted …

[PDF][PDF] Continual Meta Learning

A Sharifnassab - openmindresearch.org
Background: The origins of meta step-size optimization date back to seminal works such as
Kesten's accelerated procedure (Kesten, 1958) and Incrimental Delta-bar-Delta …