查看文章

umass.edu 中的 [PDF]

Recent advances in hierarchical reinforcement learning

作者

Andrew G Barto, Sridhar Mahadevan

发表日期

2003/10

来源

Discrete event dynamic systems

卷号

页码范围

341-379

出版商

Kluwer Academic Publishers

简介

Reinforcement learning is bedeviled by the curse of dimensionality: the number of parameters to be learned grows exponentially with the size of any compact encoding of a state. Recent attempts to combat the curse of dimensionality have turned to principled ways of exploiting temporal abstraction, where decisions are not required at each step, but rather invoke the execution of temporally-extended activities which follow their own policies until termination. This leads naturally to hierarchical control architectures and associated learning algorithms. We review several approaches to temporal abstraction and hierarchical organization that machine learning researchers have recently developed. Common to these approaches is a reliance on the theory of semi-Markov decision processes, which we emphasize in our review. We then discuss extensions of these ideas to concurrent activities, multiagent …

引用总数

被引用次数：1745

200320042005200620072008200920102011201220132014201520162017201820192020202120222023202415 38 64 74 75 79 52 65 53 80 70 61 53 76 67 90 115 106 131 126 162 78

学术搜索中的文章

Recent advances in hierarchical reinforcement learning

AG Barto, S Mahadevan - Discrete event dynamic systems, 2003

被引用次数：1745 相关文章所有 23 个版本