POMCP-based decentralized spatial task allocation algorithms for partially observable environments

Learning logic specifications for policy guidance in pomdps: an inductive logic programming approach

D Meli, A Castellini, A Farinelli - Journal of Artificial Intelligence Research, 2024 - jair.org

Abstract Partially Observable Markov Decision Processes (POMDPs) are a powerful
framework for planning under uncertainty. They allow to model state uncertainty as a belief …

被引用次数：7 相关文章所有 8 个版本

[PDF] neurips.cc

Information-guided planning: an online approach for partially observable problems

MA do Carmo Alves, A Varma… - Advances in …, 2023 - proceedings.neurips.cc

This paper presents IB-POMCP, a novel algorithm for online planning under partial
observability. Our approach enhances the decision-making process by using estimations of …

Decoupled Monte Carlo Tree Search for Cooperative Multi-Agent Planning

O Asik, FB Aydemir, HL Akın - Applied Sciences, 2023 - mdpi.com

The number of agents exponentially increases the complexity of a cooperative multi-agent
planning problem. Decoupled planning is one of the viable approaches to reduce this …

被引用次数：2 相关文章所有 5 个版本

[PDF] icicel.org

[PDF][PDF] USING PARTICLE REPRESENTATION OF BELIEFS IN AN ALPHAZERO-BASED REINFORCEMENT LEARNING ALGORITHM FOR PARTIALLY …

H Itoh, Y Kihara, H Fukumoto, H Wakuya - icicel.org

The AlphaZero algorithm is a general reinforcement learning algorithm that defeated world-
champion level programs in chess, shogi, and Go without prior knowledge. However, the …

高级搜索

QQ 群