查看文章

umass.edu 中的 [PDF]

Proto-value functions: Developmental reinforcement learning

作者

Sridhar Mahadevan

发表日期

2005/8/7

图书

Proceedings of the 22nd international conference on Machine learning

页码范围

553-560

简介

This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis functions that form the building blocks of all value functions on a given state space manifold. Proto-value functions are learned not from rewards, but instead from analyzing the topology of the state space. Formally, proto-value functions are Fourier eigenfunctions of the Laplace-Beltrami diffusion operator on the state space manifold. Proto-value functions facilitate structural decomposition of large state spaces, and form geodesically smooth orthonormal basis functions for approximating any value function. The theoretical basis for proto-value functions combines insights from spectral graph theory, harmonic analysis, and Riemannian manifolds. Proto-value functions enable a novel generation of algorithms called representation policy iteration …

引用总数

被引用次数：177

200520062007200820092010201120122013201420152016201720182019202020212022202320243 10 10 11 5 6 5 6 6 6 8 7 7 10 14 10 13 15 17 7

学术搜索中的文章

Proto-value functions: Developmental reinforcement learning

S Mahadevan - Proceedings of the 22nd international conference on …, 2005

被引用次数：177 相关文章所有 15 个版本