Learning and Planning with the Average-Reward Formulation

Y Wan - 2023 - era.library.ualberta.ca
The average-reward formulation is a natural and important formulation of learning and
planning problems, yet has received much less attention than the episodic and discounted …