A heuristic variable grid solution method for POMDPs

C Boutilier, T Dean, S Hanks - Journal of Artificial Intelligence Research, 1999 - jair.org

Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …

被引用次数：1600 相关文章所有 27 个版本

[PDF] robot.cc

[PDF][PDF] Point-based value iteration: An anytime algorithm for POMDPs

J Pineau, G Gordon, S Thrun - Ijcai, 2003 - fore.robot.cc

(PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution
by selecting a small set of representative belief points and then tracking the value and its …

被引用次数：1502 相关文章所有 25 个版本

[PDF] google.com

Optimization methods to solve adaptive management problems

I Chadès, S Nicol, TM Rout, M Péron, Y Dujardin… - Theoretical …, 2017 - Springer

Determining the best management actions is challenging when critical information is
missing. However, urgency and limited resources require that decisions must be made …

被引用次数：68 相关文章所有 12 个版本

[PDF] jair.org

Perseus: Randomized point-based value iteration for POMDPs

MTJ Spaan, N Vlassis - Journal of artificial intelligence research, 2005 - jair.org

Partially observable Markov decision processes (POMDPs) form an attractive and principled
framework for agent planning under uncertainty. Point-based approximate techniques for …

被引用次数：943 相关文章所有 22 个版本

[PDF] jair.org

Value-function approximations for partially observable Markov decision processes

M Hauskrecht - Journal of artificial intelligence research, 2000 - jair.org

Partially observable Markov decision processes (POMDPs) provide an elegant
mathematical framework for modeling complex decision and planning problems in …

被引用次数：790 相关文章所有 28 个版本

[PDF] arxiv.org

Heuristic search value iteration for POMDPs

T Smith, R Simmons - arXiv preprint arXiv:1207.4166, 2012 - arxiv.org

We present a novel POMDP planning algorithm called heuristic search value iteration
(HSVI). HSVI is an anytime algorithm that returns a policy and a provable bound on its regret …

被引用次数：702 相关文章所有 17 个版本

[PDF] hal.science

The hidden information state model: A practical framework for POMDP-based spoken dialogue management

S Young, M Gašić, S Keizer, F Mairesse… - Computer Speech & …, 2010 - Elsevier

This paper explains how Partially Observable Markov Decision Processes (POMDPs) can
provide a principled mathematical framework for modelling the inherent uncertainty in …

被引用次数：610 相关文章所有 24 个版本

[PDF] psu.edu

The Cog project: Building a humanoid robot

RA Brooks, C Breazeal, M Marjanović… - … on computation for …, 1998 - Springer

To explore issues of developmental structure, physical embodiment, integration of multiple
sensory and motor systems, and social interaction, we have constructed an upper-torso …

被引用次数：880 相关文章所有 31 个版本

[PDF] tudelft.nl

Partially observable Markov decision processes

MTJ Spaan - Reinforcement learning: State-of-the-art, 2012 - Springer

For reinforcement learning in environments in which an agent has access to a reliable state
signal, methods based on the Markov decision process (MDP) have had many successes. In …

被引用次数：453 相关文章所有 16 个版本

[PDF] jair.org

Anytime point-based approximations for large POMDPs

J Pineau, G Gordon, S Thrun - Journal of Artificial Intelligence Research, 2006 - jair.org

Abstract The Partially Observable Markov Decision Process has long been recognized as a
rich framework for real-world planning and control problems, especially in robotics. However …

被引用次数：531 相关文章所有 11 个版本

高级搜索

QQ 群