作者
Frank L Lewis, Draguna Vrabie
发表日期
2009/8/28
期刊
IEEE circuits and systems magazine
卷号
9
期号
3
页码范围
32-50
出版商
IEEE
简介
Living organisms learn by acting on their environment, observing the resulting reward stimulus, and adjusting their actions accordingly to improve the reward. This action-based or reinforcement learning can capture notions of optimal behavior occurring in natural systems. We describe mathematical formulations for reinforcement learning and a practical implementation method known as adaptive dynamic programming. These give us insight into the design of controllers for man-made engineered systems that both learn and exhibit optimal behavior.
引用总数
201020112012201320142015201620172018201920202021202220232024132355589876799711616316115215717599
学术搜索中的文章