查看文章

academia.edu 中的 [PDF]

Reinforcement learning based on actions and opposite actions

作者

Hamid R Tizhoosh

发表日期

2005/12/19

期刊

International conference on artificial intelligence and machine learning

卷号

414

简介

Reinforcement learning is a machine intelligence scheme for learning in highly dynamic and probabilistic environments. The methodology, however, suffers from a major drawback; the convergence to an optimal solution usually requires high computational expense since all states should be visited frequently in order to guarantee a reliable policy. In this paper, a new reinforcement learning algorithm is introduced to achieve a faster convergence by taking into account the opposite actions. By considering the opposite actions simultaneously multiple updates can be made for each state observation. This leads to a shorter exploration period and, hence, expedites the convergence. Experimental results for the grid world problem of different sizes are provided to verify the performance of the proposed approach.

引用总数

被引用次数：163

20062007200820092010201120122013201420152016201720182019202020212022202320246 11 10 9 5 5 7 8 12 12 16 10 7 9 9 5 9 8 4

学术搜索中的文章

Reinforcement learning based on actions and opposite actions

HR Tizhoosh - International conference on artificial intelligence and …, 2005

被引用次数：163 相关文章