相关文章- 学术资源搜索

[图书][B] Integration of partially observable Markov decision processes and reinforcement learning for simulated robot navigation

LD Pyeatt - 1999 - search.proquest.com

This dissertation presents a two level architecture for goal-directed robot control. The low
level actions are learned on-line as the robot performs its tasks, thereby reducing the need …

被引用次数：14 相关文章所有 4 个版本

[PDF] acm.org

[PDF][PDF] Integrating POMDP and reinforcement learning for a two layer simulated robot architecture

LD Pyeatt, AE Howe - Proceedings of the third annual conference on …, 1999 - dl.acm.org

Two layer control systems are common in robot architectures. The lower level is designed to
provide fast, fine grained control while the higher level plans longer term sequences of …

被引用次数：21 相关文章所有 5 个版本

[PDF] sciencedirect.com

Learning how to combine sensory-motor functions into a robust behavior

B Morisset, M Ghallab - Artificial intelligence, 2008 - Elsevier

This article describes a system, called Robel, for defining a robot controller that learns from
experience very robust ways of performing a high-level task such as “navigate to”. The …

被引用次数：37 相关文章所有 8 个版本

[PDF] psu.edu

[PDF][PDF] Designing agent controllers using discrete-event Markov models

S Mahadevan, N Khaleeli, N Marchalleck - … of the AAAI Fall Symposium on …, 1997 - Citeseer

This paper describes the use of discrete-event Markov decision process models to design
robust agent controllers in complex stochastic domains. Unlike discrete-time models, where …

被引用次数：15 相关文章所有 3 个版本

[PDF] aaai.org

[PDF][PDF] Self-organizing perceptual and temporal abstraction for robot reinforcement learning

J Provost, BJ Kuipers, R Miikkulainen - AAAI Workshop on Learning …, 2004 - cdn.aaai.org

A major current challenge in reinforcement learning research is to extend methods that work
well on discrete, short-range, low-dimensional problems to continuous, highdiameter, high …

被引用次数：22 相关文章所有 9 个版本

[PDF] researchgate.net

An architecture for behavior-based reinforcement learning

GD Konidaris, GM Hayes - Adaptive Behavior, 2005 - journals.sagepub.com

This paper introduces an integration of reinforcement learning and behavior-based control
designed to produce real-time learning in situated agents. The model layers a distributed …

被引用次数：50 相关文章所有 13 个版本

[PDF] psu.edu

[PDF][PDF] Reinforcement learning in non-markov environments

SD Whitehead, LJ Lin - Artificial Intelligence. Submitted, 1993 - Citeseer

Recently, techniques based on reinforcement learning (RL) have been used to build
systems that learn to perform non-trivial sequential decision tasks. To date, most of this work …

被引用次数：8 相关文章

Learning multiple goal behavior via task decomposition and dynamic policy merging

S Whitehead, J Karlsson, J Tenenberg - Robot learning, 1993 - Springer

An ability to coordinate the pursuit of multiple, time-varying goals is important to an
intelligent robot. In this chapter we consider the application of reinforcement learning to a …

被引用次数：130 相关文章所有 5 个版本

[PDF] researchgate.net

[PDF][PDF] Learning qualitative Markov decision processes

A Reyes, LE Sucar, E Morales… - … systems nips 2005 …, 2005 - researchgate.net

To navigate in natural environments, a robot must decide the best action to take according to
its current situation and goal, a problem that can be represented as a Markov Decision …

被引用次数：3 相关文章

[PDF] psu.edu

[PDF][PDF] Learning robot control-using control policies as abstract actions

M Huber, RA Grupen - Proceedings of the NIPS'98 Workshop on …, 1998 - Citeseer

Autonomous robot systems operating in an uncertain environment have to be able to cope
with new situations and task requirements. Important properties of the control architecture of …

被引用次数：21 相关文章所有 4 个版本

高级搜索

QQ 群

[图书][B] Integration of partially observable Markov decision processes and reinforcement learning for simulated robot navigation

[PDF][PDF] Integrating POMDP and reinforcement learning for a two layer simulated robot architecture

Learning how to combine sensory-motor functions into a robust behavior

[PDF][PDF] Designing agent controllers using discrete-event Markov models

[PDF][PDF] Self-organizing perceptual and temporal abstraction for robot reinforcement learning

An architecture for behavior-based reinforcement learning

[PDF][PDF] Reinforcement learning in non-markov environments

Learning multiple goal behavior via task decomposition and dynamic policy merging

[PDF][PDF] Learning qualitative Markov decision processes

[PDF][PDF] Learning robot control-using control policies as abstract actions

引用