Complexity of finite-horizon Markov decision process problems

M Mundhenk, J Goldsmith, C Lusena… - Journal of the ACM …, 2000 - dl.acm.org
Controlled stochastic systems occur in science engineering, manufacturing, social sciences,
and many other cntexts. If the systems is modeled as a Markov decision process (MDP) and …

The complexity of planning with partially-observable Markov Decision Processes

M Mundhenk - 2000 - digitalcommons.dartmouth.edu
This work surveys results on the complexity of planning under uncertainty. The planning
model considered is the partially-observable Markov decision process. The general …

Teaching robots to plan through Q-learning

D Gu, H Hu - Robotica, 2005 - cambridge.org
This paper presents a Q-learning approach to state-based planning of behaviour-based
walking robots. The learning process consists of a teaching stage and an autonomous …

Monte carlo localization for mobile robots in dynamic environments

A Bansail - 2002 - ttu-ir.tdl.org
Mobile robot localization is the problem of determining a robot's pose from sensor data. This
thesis presents a family of probabilistic localization algorithms known as Monte Carlo …

Evaluating robustness in a two layer simulated robot architecture

LD Pyeatt, AE Howe - Journal of Experimental & Theoretical …, 2000 - Taylor & Francis
Many two layer robot architectures have been proposed and implemented. While
justification for the design can be well argued, how does one know it is really a good idea …

Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

SM Kalami Heris, MB Naghibi Sistani… - … and Applications. With …, 2009 - Springer
Abstract Markov Decision Process (MDP) has enormous applications in science,
engineering, economics and management. Most of decision processes have Markov …

Reinforcement learning in the control of a simulated life support system

TM Quasny - 2003 - ttu-ir.tdl.org
Since the 1970s, the National Aeronautics and Space Administration (NASA) has been
conducting experiments to improve the duration and safety of manned space missions. For …

بررسی یادگیری تقویتی و خواص سیاست بهینه در مسایل جدولی با استفاده از روش های کنترل دیجیتال

کلامی هریس سیدمصطفی, پریز ناصر… - 2009‎ - sid.ir
فرآیند تصمیم گیری مارکوف یا MDP, یکی از مسایلی است که دارای کاربردهای وسیعی در زمینه
های مختلف علمی, مهندسی, اقتصادی و مدیریت است. بسیاری از فرآیندهای تصمیم گیری, دارای …

[PDF][PDF] Performance of a Single Action Partially Observable Markov Decision Process in a Recognition Task

BLMTM Quasny, LDPED Sinzinger - researchgate.net
Abstract Partially Observable Markov Decision Processes (POMDPs) have been applied
extensively to planning in environments where knowledge of an underlying process is …

Real time Markov localization for mobile robots using pre-computation of sensor model

S Kona - 2002 - ttu-ir.tdl.org
Localization, that is the estimation of a robot's location from sensor data, is a fundamental
problem in mobile robotics. This thesis presents a version of Markov Localization that …