作者
Karina Valdivia Delgado, Scott Sanner, Leliane Nunes De Barros
发表日期
2011/6/1
期刊
Artificial Intelligence
卷号
175
期号
9-10
页码范围
1498-1527
出版商
Elsevier
简介
When modeling real-world decision-theoretic planning problems in the Markov Decision Process (MDP) framework, it is often impossible to obtain a completely accurate estimate of transition probabilities. For example, natural uncertainty arises in the transition specification due to elicitation of MDP transition models from an expert or estimation from data, or non-stationary transition distributions arising from insufficient state knowledge. In the interest of obtaining the most robust policy under transition uncertainty, the Markov Decision Process with Imprecise Transition Probabilities (MDP-IPs) has been introduced to model such scenarios. Unfortunately, while various solution algorithms exist for MDP-IPs, they often require external calls to optimization routines and thus can be extremely time-consuming in practice. To address this deficiency, we introduce the factored MDP-IP and propose efficient dynamic programming …
引用总数
20102011201220132014201520162017201820192020202120222023202444813973879128424
学术搜索中的文章