作者
Frank L Lewis, Draguna Vrabie
发表日期
2009/8/28
期刊
IEEE circuits and systems magazine
卷号
9
期号
3
页码范围
32-50
出版商
IEEE
简介
Living organisms learn by acting on their environment, observing the resulting reward stimulus, and adjusting their actions accordingly to improve the reward. This action-based or reinforcement learning can capture notions of optimal behavior occurring in natural systems. We describe mathematical formulations for reinforcement learning and a practical implementation method known as adaptive dynamic programming. These give us insight into the design of controllers for man-made engineered systems that both learn and exhibit optimal behavior.
引用总数
学术搜索中的文章
FL Lewis, D Vrabie - IEEE circuits and systems magazine, 2009