related:uk_lcPLSDcoJ:scholar.google.com/

[PDF][PDF] Roles of macro-actions in accelerating reinforcement learning

A McGovern, RS Sutton, AH Fagg - Grace Hopper celebration …, 1997 - incompleteideas.net

We analyze the use of built-in policies, or macro-actions, as a form of domain knowledge
that can improve the speed and scaling of reinforcement learning algorithms. Such macro …

被引用次数：137 相关文章所有 17 个版本

[PDF] researchgate.net

[PDF][PDF] acQuire-macros: An algorithm for automatically learning macro-actions

A McGovern - Proceedings of the NIPS'98 Workshop on …, 1998 - researchgate.net

We present part of a new algorithm for automatically growing macro-actions online in a
reinforcement learning framework. We call this algorithm acQuire-macros. We present …

被引用次数：37 相关文章所有 8 个版本

[PDF] umass.edu

[PDF][PDF] Macro-actions in reinforcement learning: An empirical analysis

A McGovern, RS Sutton - Computer Science Department …, 1998 - scholarworks.umass.edu

Several researchers have proposed reinforcement learning methods that obtain advantages
in learning by using temporally extended actions, or macro-actions, but none has carefully …

被引用次数：83 相关文章所有 15 个版本

[PDF] bstu.by

Learning options in reinforcement learning

M Stolle, D Precup - … 5th International Symposium, SARA 2002 Kananaskis …, 2002 - Springer

Temporally extended actions (eg, macro actions) have proven very useful for speeding up
learning, ensuring robustness and building prior knowledge into AI systems. The options …

被引用次数：470 相关文章所有 19 个版本

[PDF] neurips.cc

Finding structure in reinforcement learning

S Thrun, A Schwartz - Advances in neural information …, 1994 - proceedings.neurips.cc

Reinforcement learning addresses the problem of learning to select actions in order to
maximize one's performance in unknown environments. To scale reinforcement learning to …

被引用次数：302 相关文章所有 19 个版本

[PDF] marcpickett.com

[PDF][PDF] Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning

M Pickett, AG Barto - ICML, 2002 - marcpickett.com

We present PolicyBlocks, an algorithm by which a reinforcement learning agent can extract
useful macro-actions from a set of related tasks. The agent creates macroactions by finding …

被引用次数：146 相关文章所有 3 个版本

[PDF] neurips.cc

Learning macro-actions in reinforcement learning

J Randlov - Advances in Neural Information Processing …, 1998 - proceedings.neurips.cc

We present a method for automatically constructing macro-actions from scratch from
primitive actions during the reinforcement learning process. The overall idea is to reinforce …

被引用次数：37 相关文章所有 7 个版本

[PDF] psu.edu

Reward functions for accelerated learning

MJ Mataric - Machine learning proceedings 1994, 1994 - Elsevier

This paper discusses why traditional reinforcement learning methods, and algorithms
applied to those models, result in poor performance in situated domains characterized by …

被引用次数：660 相关文章所有 9 个版本

[PDF] cas.cz

[PDF][PDF] Supervised learning combined with an actor-critic architecture

MT Rosenstein, AG Barto - Department of Computer Science …, 2002 - mys.utia.cas.cz

To address the shortcomings of reinforcement learning (RL) a number of researchers have
focused recently on ways to take advantage of structure in RL problems and on ways to …

被引用次数：39 相关文章所有 4 个版本

[PDF] bstu.by

[PDF][PDF] Exploration and exploitation in reinforcement learning

M Coggan - Research supervised by Prof. Doina Precup, CRA-W …, 2004 - neuro.bstu.by

A common problem in reinforcement learning is finding a balance between exploration
(attempting to discover new features about the world by a selecting sub-optimal action) and …

被引用次数：112 相关文章所有 5 个版本

高级搜索

QQ 群