查看文章

researchgate.net 中的 [PDF]

Opposition-based reinforcement learning

作者

Hamid R Tizhoosh

发表日期

2006

期刊

Journal of Advanced Computational Intelligence and Intelligent Informatics

卷号

期号

简介

Reinforcement learning is a machine intelligence scheme for learning in highly dynamic, probabilistic environments. By interaction with the environment, reinforcement agents learn optimal control policies, especially in the absence of a priori knowledge and/or a sufficiently large amount of training data. Despite its advantages, however, reinforcement learning suffers from a major drawback–high calculation cost because convergence to an optimal solution usually requires that all states be visited frequently to ensure that policy is reliable. This is not always possible, however, due to the complex, high-dimensional state space in many applications. This paper introduces opposition-based reinforcement learning, inspired by opposition-based learning, to speed up convergence. Considering opposite actions simultaneously enables individual states to be updated more than once shortening exploration and expediting convergence. Three versions of Q-learning algorithm will be given as examples. Experimental results for the grid world problem of different sizes demonstrate the superior performance of the proposed approach.

引用总数

被引用次数：240

20062007200820092010201120122013201420152016201720182019202020212022202320244 14 14 13 5 8 16 13 20 19 22 19 8 12 10 16 12 7 5

学术搜索中的文章

Opposition-based reinforcement learning

HR Tizhoosh - Journal of Advanced Computational Intelligence and …, 2006

被引用次数：240 相关文章所有 4 个版本