Decision-theoretic planning: Structural assumptions and computational leverage

C Boutilier, T Dean, S Hanks - Journal of Artificial Intelligence Research, 1999 - jair.org
Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …

[PDF][PDF] Point-based value iteration: An anytime algorithm for POMDPs

J Pineau, G Gordon, S Thrun - Ijcai, 2003 - fore.robot.cc
(PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution
by selecting a small set of representative belief points and then tracking the value and its …

Optimization methods to solve adaptive management problems

I Chadès, S Nicol, TM Rout, M Péron, Y Dujardin… - Theoretical …, 2017 - Springer
Determining the best management actions is challenging when critical information is
missing. However, urgency and limited resources require that decisions must be made …

Perseus: Randomized point-based value iteration for POMDPs

MTJ Spaan, N Vlassis - Journal of artificial intelligence research, 2005 - jair.org
Partially observable Markov decision processes (POMDPs) form an attractive and principled
framework for agent planning under uncertainty. Point-based approximate techniques for …

Value-function approximations for partially observable Markov decision processes

M Hauskrecht - Journal of artificial intelligence research, 2000 - jair.org
Partially observable Markov decision processes (POMDPs) provide an elegant
mathematical framework for modeling complex decision and planning problems in …

Heuristic search value iteration for POMDPs

T Smith, R Simmons - arXiv preprint arXiv:1207.4166, 2012 - arxiv.org
We present a novel POMDP planning algorithm called heuristic search value iteration
(HSVI). HSVI is an anytime algorithm that returns a policy and a provable bound on its regret …

The hidden information state model: A practical framework for POMDP-based spoken dialogue management

S Young, M Gašić, S Keizer, F Mairesse… - Computer Speech & …, 2010 - Elsevier
This paper explains how Partially Observable Markov Decision Processes (POMDPs) can
provide a principled mathematical framework for modelling the inherent uncertainty in …

The Cog project: Building a humanoid robot

RA Brooks, C Breazeal, M Marjanović… - … on computation for …, 1998 - Springer
To explore issues of developmental structure, physical embodiment, integration of multiple
sensory and motor systems, and social interaction, we have constructed an upper-torso …

Partially observable Markov decision processes

MTJ Spaan - Reinforcement learning: State-of-the-art, 2012 - Springer
For reinforcement learning in environments in which an agent has access to a reliable state
signal, methods based on the Markov decision process (MDP) have had many successes. In …

Anytime point-based approximations for large POMDPs

J Pineau, G Gordon, S Thrun - Journal of Artificial Intelligence Research, 2006 - jair.org
Abstract The Partially Observable Markov Decision Process has long been recognized as a
rich framework for real-world planning and control problems, especially in robotics. However …