[图书][B] Integration of partially observable Markov decision processes and reinforcement learning for simulated robot navigation

LD Pyeatt - 1999 - search.proquest.com
This dissertation presents a two level architecture for goal-directed robot control. The low
level actions are learned on-line as the robot performs its tasks, thereby reducing the need …

[PDF][PDF] Integrating POMDP and reinforcement learning for a two layer simulated robot architecture

LD Pyeatt, AE Howe - Proceedings of the third annual conference on …, 1999 - dl.acm.org
Two layer control systems are common in robot architectures. The lower level is designed to
provide fast, fine grained control while the higher level plans longer term sequences of …

Learning how to combine sensory-motor functions into a robust behavior

B Morisset, M Ghallab - Artificial intelligence, 2008 - Elsevier
This article describes a system, called Robel, for defining a robot controller that learns from
experience very robust ways of performing a high-level task such as “navigate to”. The …

[PDF][PDF] Designing agent controllers using discrete-event Markov models

S Mahadevan, N Khaleeli, N Marchalleck - … of the AAAI Fall Symposium on …, 1997 - Citeseer
This paper describes the use of discrete-event Markov decision process models to design
robust agent controllers in complex stochastic domains. Unlike discrete-time models, where …

[PDF][PDF] Self-organizing perceptual and temporal abstraction for robot reinforcement learning

J Provost, BJ Kuipers, R Miikkulainen - AAAI Workshop on Learning …, 2004 - cdn.aaai.org
A major current challenge in reinforcement learning research is to extend methods that work
well on discrete, short-range, low-dimensional problems to continuous, highdiameter, high …

An architecture for behavior-based reinforcement learning

GD Konidaris, GM Hayes - Adaptive Behavior, 2005 - journals.sagepub.com
This paper introduces an integration of reinforcement learning and behavior-based control
designed to produce real-time learning in situated agents. The model layers a distributed …

[PDF][PDF] Reinforcement learning in non-markov environments

SD Whitehead, LJ Lin - Artificial Intelligence. Submitted, 1993 - Citeseer
Recently, techniques based on reinforcement learning (RL) have been used to build
systems that learn to perform non-trivial sequential decision tasks. To date, most of this work …

Learning multiple goal behavior via task decomposition and dynamic policy merging

S Whitehead, J Karlsson, J Tenenberg - Robot learning, 1993 - Springer
An ability to coordinate the pursuit of multiple, time-varying goals is important to an
intelligent robot. In this chapter we consider the application of reinforcement learning to a …

[PDF][PDF] Learning qualitative Markov decision processes

A Reyes, LE Sucar, E Morales… - … systems nips 2005 …, 2005 - researchgate.net
To navigate in natural environments, a robot must decide the best action to take according to
its current situation and goal, a problem that can be represented as a Markov Decision …

[PDF][PDF] Learning robot control-using control policies as abstract actions

M Huber, RA Grupen - Proceedings of the NIPS'98 Workshop on …, 1998 - Citeseer
Autonomous robot systems operating in an uncertain environment have to be able to cope
with new situations and task requirements. Important properties of the control architecture of …