This paper presents a detailed study of average reward reinforcement learning, an undiscounted optimality framework that is more appropriate for cyclical tasks than the much …
This paper presents a detailed study of average reward reinforcement learning, an undiscounted optimality framework that is more appropriate for cyclical tasks than the much …
S MAHADEVAN - Machine Learning, 1996 - people.cs.umass.edu
This paper presents a detailed study of average reward reinforcement learning, an undiscounted optimality framework that is more appropriate for cyclical tasks than the much …
S Mahadevan - Machine Learning, 1996 - elibrary.ru
This paper presents a detailed study of average reward reinforcement learning, an undiscounted optimality framework that is more appropriate for cyclical tasks than the much …
This paper presents a detailed study of average reward reinforcement learning, an undiscounted optimality framework that is more appropriate for cyclical tasks than the much …
Average reward reinforcement learning: foundations, algorithms, and empirical results: Machine Language: Vol 22, No 1-3 ACM Digital Library home ACM home Google, Inc. (search) …
This paper presents a detailed study of average reward reinforcement learning, an undiscounted optimality framework that is more appropriate for cyclical tasks than the much …
[引用][C]Average reward reinforcement learning: Foundations, algorithms, and empirical results