作者
Amy McGovern, Andrew G Barto
发表日期
2001
简介
This paper presents a method by which a reinforcement learning agent can automatically discover certain types of subgoals online. By creating useful new subgoals while learning, the agent is able to accelerate learning on the current task and to transfer its expertise to other, related tasks through the reuse of its ability to attain subgoals. The agent discovers subgoals based on commonalities across multiple paths to a solution. We cast the task of finding these commonalities as a multiple-instance learning problem and use the concept of diverse density to find solutions. We illustrate this approach using several gridworld tasks.
引用总数
2001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320244910272435293421232921302921292129523240453417