查看文章

icml.cc 中的 [PDF]

Dynamic abstraction in reinforcement learning via clustering

作者

Shie Mannor, Ishai Menache, Amit Hoze, Uri Klein

发表日期

2004/7/4

研讨会论文

Proceedings of the twenty-first international conference on Machine learning

页码范围

出版商

ACM

简介

We consider a graph theoretic approach for automatic construction of options in a dynamic environment. A map of the environment is generated on-line by the learning agent, representing the topological structure of the state transitions. A clustering algorithm is then used to partition the state space to different regions. Policies for reaching the different parts of the space are separately learned and added to the model in a form of options (macro-actions). The options are used for accelerating the Q-Learning algorithm. We extend the basic algorithm and consider building a map that includes preliminary indication of the location of "interesting" regions of the state space, where the value gradient is significant and additional exploration might be beneficial. Experiments indicate significant speedups, especially in the initial learning phase.

引用总数

被引用次数：329

2004200520062007200820092010201120122013201420152016201720182019202020212022202320248 15 8 16 16 11 14 17 7 18 19 13 18 16 18 26 22 15 15 28 8

学术搜索中的文章

Dynamic abstraction in reinforcement learning via clustering

S Mannor, I Menache, A Hoze, U Klein - Proceedings of the twenty-first international conference …, 2004

被引用次数：329 相关文章所有 10 个版本