作者
Amy McGovern, Richard S Sutton, Andrew H Fagg
发表日期
1997/9
期刊
Grace Hopper celebration of women in computing
卷号
1317
页码范围
15
简介
We analyze the use of built-in policies, or macro-actions, as a form of domain knowledge that can improve the speed and scaling of reinforcement learning algorithms. Such macro-actions are often used in robotics, and macro-operators are also well-known as an aid to state-space search in AI systems. The macro-actions we consider are closed-loop policies with termi-nation conditions. The macro-actions can be chosen at the same level as primitive actions. Macro-actions commit the learning agent to act in a particular, purposeful way for a sustained period of time. Overall, macro-actions may either accelerate or retard learn-ing, depending on the appropriateness of the macro-actions to the particular task. We analyze their effect in a simple example, breaking the acceleration effect into two parts: 1) the effect of the macro-action in changing exploratory behavior, independent of learning, and 2) the effect of the macro …
引用总数
199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202421252692455144333573775943572
学术搜索中的文章
A McGovern, RS Sutton, AH Fagg - Grace Hopper celebration of women in computing, 1997