Integration of partially observable Markov decision processes and reinforcement learning...

M Mundhenk, J Goldsmith, C Lusena… - Journal of the ACM …, 2000 - dl.acm.org

Controlled stochastic systems occur in science engineering, manufacturing, social sciences,
and many other cntexts. If the systems is modeled as a Markov decision process (MDP) and …

被引用次数：229 相关文章所有 17 个版本

[PDF] dartmouth.edu

The complexity of planning with partially-observable Markov Decision Processes

M Mundhenk - 2000 - digitalcommons.dartmouth.edu

This work surveys results on the complexity of planning under uncertainty. The planning
model considered is the partially-observable Markov decision process. The general …

被引用次数：10 相关文章所有 3 个版本

[PDF] academia.edu

Teaching robots to plan through Q-learning

D Gu, H Hu - Robotica, 2005 - cambridge.org

This paper presents a Q-learning approach to state-based planning of behaviour-based
walking robots. The learning process consists of a teaching stage and an autonomous …

被引用次数：9 相关文章所有 9 个版本

[PDF] tdl.org

Monte carlo localization for mobile robots in dynamic environments

A Bansail - 2002 - ttu-ir.tdl.org

Mobile robot localization is the problem of determining a robot's pose from sensor data. This
thesis presents a family of probabilistic localization algorithms known as Monte Carlo …

被引用次数：2 相关文章所有 3 个版本

Evaluating robustness in a two layer simulated robot architecture

LD Pyeatt, AE Howe - Journal of Experimental & Theoretical …, 2000 - Taylor & Francis

Many two layer robot architectures have been proposed and implemented. While
justification for the design can be well argued, how does one know it is really a good idea …

被引用次数：2 相关文章所有 2 个版本

[PDF] psu.edu

Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

SM Kalami Heris, MB Naghibi Sistani… - … and Applications. With …, 2009 - Springer

Abstract Markov Decision Process (MDP) has enormous applications in science,
engineering, economics and management. Most of decision processes have Markov …

被引用次数：1 相关文章所有 12 个版本

[PDF] tdl.org

Reinforcement learning in the control of a simulated life support system

TM Quasny - 2003 - ttu-ir.tdl.org

Since the 1970s, the National Aeronautics and Space Administration (NASA) has been
conducting experiments to improve the duration and safety of manned space missions. For …

被引用次数：1 相关文章所有 7 个版本

[PDF] sid.ir

بررسی یادگیری تقویتی و خواص سیاست بهینه در مسایل جدولی با استفاده از روش های کنترل دیجیتال‎

کلامی هریس سیدمصطفی, پریز ناصر… - 2009‎ - sid.ir

فرآیند تصمیم گیری مارکوف یا MDP, یکی از مسایلی است که دارای کاربردهای وسیعی در زمینه
های مختلف علمی, مهندسی, اقتصادی و مدیریت است. بسیاری از فرآیندهای تصمیم گیری, دارای …‎

[PDF][PDF] Performance of a Single Action Partially Observable Markov Decision Process in a Recognition Task

BLMTM Quasny, LDPED Sinzinger - researchgate.net

Abstract Partially Observable Markov Decision Processes (POMDPs) have been applied
extensively to planning in environments where knowledge of an underlying process is …

[PDF] tdl.org

Real time Markov localization for mobile robots using pre-computation of sensor model

S Kona - 2002 - ttu-ir.tdl.org

Localization, that is the estimation of a robot's location from sensor data, is a fundamental
problem in mobile robotics. This thesis presents a version of Markov Localization that …

高级搜索

QQ 群