Faster stackelberg planning via symbolic search and information sharing

Á Torralba, P Speicher, R Künnemann… - Proceedings of the …, 2021 - ojs.aaai.org
Stackelberg planning is a recent framework where a leader and a follower each choose a
plan in the same planning task, the leader's objective being to maximize plan cost for the …

Globally optimal hierarchical reinforcement learning for linearly-solvable markov decision processes

G Infante, A Jonsson, V Gómez - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
We present a novel approach to hierarchical reinforcement learning for linearly-solvable
Markov decision processes. Our approach assumes that the state space is partitioned, and …

Pattern Databases for Stochastic Shortest Path Problems

T Klößner, J Hoffmann - Proceedings of the International Symposium on …, 2021 - ojs.aaai.org
Stochastic shortest-path problems (SSP) are an important subclass of MDPs for which
heuristic search algorithms exist since over a decade. Yet most known heuristic functions …

[HTML][HTML] State space search nogood learning: Online refinement of critical-path dead-end detectors in planning

M Steinmetz, J Hoffmann - Artificial Intelligence, 2017 - Elsevier
Conflict-directed learning is ubiquitous in constraint satisfaction problems like SAT, but has
been elusive for state space search on reachability problems like classical planning. Almost …

A theory of merge-and-shrink for stochastic shortest path problems

T Klößner, Á Torralba, M Steinmetz… - Proceedings of the …, 2023 - ojs.aaai.org
The merge-and-shrink framework is a powerful tool to construct state space abstractions
based on factored representations. One of its core applications in classical planning is the …

Hybrid mission planning with coalition formation

A Dukeman, JA Adams - Autonomous Agents and Multi-Agent Systems, 2017 - Springer
The increase in robotic capabilities and the number of such systems being used has
resulted in opportunities for robots to work alongside humans in an increasing number of …

Towards clause-learning state space search: Learning to recognize dead-ends

M Steinmetz, J Hoffmann - Proceedings of the AAAI Conference on …, 2016 - ojs.aaai.org
We introduce a state space search method that identifies dead-end states, analyzes the
reasons for failure, and learns to avoid similar mistakes in the future. Our work is placed in …

Directed fixed-point regression-based planning for non-deterministic domains

M Ramirez, S Sardina - Proceedings of the International Conference on …, 2014 - ojs.aaai.org
We present a novel approach to fully-observable nondeterministic planning (FOND) that
attempts to bridge the gap between symbolic fix-point computation and recent approaches …

Revisiting goal probability analysis in probabilistic planning

M Steinmetz, J Hoffmann, O Buffet - Proceedings of the International …, 2016 - ojs.aaai.org
Maximizing goal probability is an important objective in probabilistic planning, yet algorithms
for its optimal solution are severely underexplored. There is scant evidence of what the …

Classical planning in MDP heuristics: With a little help from generalization

A Kolobov, D Weld - Proceedings of the International Conference on …, 2010 - ojs.aaai.org
Computing a good policy in stochastic uncertain environments with unknown dynamics and
reward model parameters is a challenging task. In a number of domains, ranging from space …