- 学术资源搜索

Faster stackelberg planning via symbolic search and information sharing

Á Torralba, P Speicher, R Künnemann… - Proceedings of the …, 2021 - ojs.aaai.org

Stackelberg planning is a recent framework where a leader and a follower each choose a
plan in the same planning task, the leader's objective being to maximize plan cost for the …

被引用次数：12 相关文章所有 8 个版本

[PDF] aaai.org

Globally optimal hierarchical reinforcement learning for linearly-solvable markov decision processes

G Infante, A Jonsson, V Gómez - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org

We present a novel approach to hierarchical reinforcement learning for linearly-solvable
Markov decision processes. Our approach assumes that the state space is partitioned, and …

被引用次数：9 相关文章所有 8 个版本

[PDF] aaai.org

Pattern Databases for Stochastic Shortest Path Problems

T Klößner, J Hoffmann - Proceedings of the International Symposium on …, 2021 - ojs.aaai.org

Stochastic shortest-path problems (SSP) are an important subclass of MDPs for which
heuristic search algorithms exist since over a decade. Yet most known heuristic functions …

被引用次数：11 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] State space search nogood learning: Online refinement of critical-path dead-end detectors in planning

M Steinmetz, J Hoffmann - Artificial Intelligence, 2017 - Elsevier

Conflict-directed learning is ubiquitous in constraint satisfaction problems like SAT, but has
been elusive for state space search on reachability problems like classical planning. Almost …

被引用次数：28 相关文章所有 6 个版本

[PDF] aaai.org

A theory of merge-and-shrink for stochastic shortest path problems

T Klößner, Á Torralba, M Steinmetz… - Proceedings of the …, 2023 - ojs.aaai.org

The merge-and-shrink framework is a powerful tool to construct state space abstractions
based on factored representations. One of its core applications in classical planning is the …

被引用次数：3 相关文章所有 9 个版本

[PDF] vanderbilt.edu

Hybrid mission planning with coalition formation

A Dukeman, JA Adams - Autonomous Agents and Multi-Agent Systems, 2017 - Springer

The increase in robotic capabilities and the number of such systems being used has
resulted in opportunities for robots to work alongside humans in an increasing number of …

被引用次数：23 相关文章所有 6 个版本

[PDF] aaai.org

Towards clause-learning state space search: Learning to recognize dead-ends

M Steinmetz, J Hoffmann - Proceedings of the AAAI Conference on …, 2016 - ojs.aaai.org

We introduce a state space search method that identifies dead-end states, analyzes the
reasons for failure, and learns to avoid similar mistakes in the future. Our work is placed in …

被引用次数：26 相关文章所有 11 个版本

[PDF] aaai.org

Directed fixed-point regression-based planning for non-deterministic domains

M Ramirez, S Sardina - Proceedings of the International Conference on …, 2014 - ojs.aaai.org

We present a novel approach to fully-observable nondeterministic planning (FOND) that
attempts to bridge the gap between symbolic fix-point computation and recent approaches …

被引用次数：23 相关文章所有 8 个版本

[PDF] aaai.org

Revisiting goal probability analysis in probabilistic planning

M Steinmetz, J Hoffmann, O Buffet - Proceedings of the International …, 2016 - ojs.aaai.org

Maximizing goal probability is an important objective in probabilistic planning, yet algorithms
for its optimal solution are severely underexplored. There is scant evidence of what the …

被引用次数：22 相关文章所有 13 个版本

[PDF] aaai.org

Classical planning in MDP heuristics: With a little help from generalization

A Kolobov, D Weld - Proceedings of the International Conference on …, 2010 - ojs.aaai.org

Computing a good policy in stochastic uncertain environments with unknown dynamics and
reward model parameters is a challenging task. In a number of domains, ranging from space …

被引用次数：30 相关文章所有 14 个版本

高级搜索

QQ 群

Faster stackelberg planning via symbolic search and information sharing

Globally optimal hierarchical reinforcement learning for linearly-solvable markov decision processes

Pattern Databases for Stochastic Shortest Path Problems

[HTML][HTML] State space search nogood learning: Online refinement of critical-path dead-end detectors in planning

A theory of merge-and-shrink for stochastic shortest path problems

Hybrid mission planning with coalition formation

Towards clause-learning state space search: Learning to recognize dead-ends

Directed fixed-point regression-based planning for non-deterministic domains

Revisiting goal probability analysis in probabilistic planning

Classical planning in MDP heuristics: With a little help from generalization

引用