作者
Scott Sanner
发表日期
2010
期刊
Unpublished ms. Australian National University
卷号
32
页码范围
27
简介
The Relational Dynamic Influence Diagram Language (RDDL) is a uniform language where states, actions, and observations (whether discrete or continuous) are parameterized variables and the evolution of a fully or partially observed (stochastic) process is specified via (stochastic) functions over next state variables conditioned on current state and action variables (nb, concurrency is allowed). Parameterized variables are simply templates for ground variables that can be obtained when given a particular problem instance defining possible domain objects. Semantically, RDDL is simply a dynamic Bayes net (DBN)[1](with potentially many intermediate layers) extended with a simple influence diagram (ID)[2] utility node representing immediate reward. An objective function specifies how these immediate rewards should be optimized over time for optimal control. For a ground instance, RDDL is just a factored MDP (or POMDP, if partially observed).
引用总数
2010201120122013201420152016201720182019202020212022202320241216251823252623253522262513
学术搜索中的文章