查看文章

jair.org 中的 [PDF]

Optimally Solving Dec-POMDPs as Continuous-State MDPs

作者

Jilles Steeve Dibangoye, Christopher Amato, Olivier Buffet, François Charpillet

发表日期

2013

期刊

Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence (IJCAI-13)}

简介

Decentralized partially observable Markov decision processes (Dec-POMDPs) provide a general model for decision-making under uncertainty in decentralized settings, but are difficult to solve optimally (NEXP-Complete). As a new way of solving these problems, we introduce the idea of transforming a Dec-POMDP into a continuous-state deterministic MDP with a piecewise-linear and convex value function. This approach makes use of the fact that planning can be accomplished in a centralized offline manner, while execution can still be decentralized. This new Dec-POMDP formulation, which we call an occupancy MDP, allows powerful POMDP and continuous-state MDP methods to be used for the first time. To provide scalability, we refine this approach by combining heuristic search and compact representations that exploit the structure present in multi-agent domains, without losing the ability to converge to an optimal solution. In particular, we introduce a feature-based heuristic search value iteration (FB-HSVI) algorithm that relies on feature-based compact representations, point-based updates and efficient action selection. A theoretical analysis demonstrates that FB-HSVI terminates in finite time with an optimal solution. We include an extensive empirical analysis using well-known benchmarks, thereby demonstrating that our approach provides significant scalability improvements compared to the state of the art.

引用总数

被引用次数：163

201520162017201820192020202120222023202413 10 11 19 11 22 16 15 26 10

学术搜索中的文章

Optimally solving Dec-POMDPs as continuous-state MDPs

JS Dibangoye, C Amato, O Buffet, F Charpillet - Journal of Artificial Intelligence Research, 2016

被引用次数：163 相关文章所有 28 个版本

Résolution exacte des Dec-POMDPs comme des MDPs continus*

JS Dibangoye, C Amato, O Buffet, F Charpillet - 8èmes Journées Francophones sur la Planification, la …, 2013