[PDF][PDF] Roles of macro-actions in accelerating reinforcement learning

A McGovern, RS Sutton, AH Fagg - Grace Hopper celebration …, 1997 - incompleteideas.net
We analyze the use of built-in policies, or macro-actions, as a form of domain knowledge
that can improve the speed and scaling of reinforcement learning algorithms. Such macro …

[PDF][PDF] acQuire-macros: An algorithm for automatically learning macro-actions

A McGovern - Proceedings of the NIPS'98 Workshop on …, 1998 - researchgate.net
We present part of a new algorithm for automatically growing macro-actions online in a
reinforcement learning framework. We call this algorithm acQuire-macros. We present …

[PDF][PDF] Macro-actions in reinforcement learning: An empirical analysis

A McGovern, RS Sutton - Computer Science Department …, 1998 - scholarworks.umass.edu
Several researchers have proposed reinforcement learning methods that obtain advantages
in learning by using temporally extended actions, or macro-actions, but none has carefully …

Learning options in reinforcement learning

M Stolle, D Precup - … 5th International Symposium, SARA 2002 Kananaskis …, 2002 - Springer
Temporally extended actions (eg, macro actions) have proven very useful for speeding up
learning, ensuring robustness and building prior knowledge into AI systems. The options …

Finding structure in reinforcement learning

S Thrun, A Schwartz - Advances in neural information …, 1994 - proceedings.neurips.cc
Reinforcement learning addresses the problem of learning to select actions in order to
maximize one's performance in unknown environments. To scale reinforcement learning to …

[PDF][PDF] Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning

M Pickett, AG Barto - ICML, 2002 - marcpickett.com
We present PolicyBlocks, an algorithm by which a reinforcement learning agent can extract
useful macro-actions from a set of related tasks. The agent creates macroactions by finding …

Learning macro-actions in reinforcement learning

J Randlov - Advances in Neural Information Processing …, 1998 - proceedings.neurips.cc
We present a method for automatically constructing macro-actions from scratch from
primitive actions during the reinforcement learning process. The overall idea is to reinforce …

Reward functions for accelerated learning

MJ Mataric - Machine learning proceedings 1994, 1994 - Elsevier
This paper discusses why traditional reinforcement learning methods, and algorithms
applied to those models, result in poor performance in situated domains characterized by …

[PDF][PDF] Supervised learning combined with an actor-critic architecture

MT Rosenstein, AG Barto - Department of Computer Science …, 2002 - mys.utia.cas.cz
To address the shortcomings of reinforcement learning (RL) a number of researchers have
focused recently on ways to take advantage of structure in RL problems and on ways to …

[PDF][PDF] Exploration and exploitation in reinforcement learning

M Coggan - Research supervised by Prof. Doina Precup, CRA-W …, 2004 - neuro.bstu.by
A common problem in reinforcement learning is finding a balance between exploration
(attempting to discover new features about the world by a selecting sub-optimal action) and …