相关文章- 学术资源搜索

Size independent neural transfer for rddl planning

S Garg, A Bajpai - Proceedings of the International Conference on …, 2019 - aaai.org

Neural planners for RDDL MDPs produce deep reactive policies in an offline fashion. These
scale well with large domains, but are sample inefficient and time-consuming to train from …

被引用次数：48 相关文章所有 4 个版本

[PDF] neurips.cc

Transfer of deep reactive policies for mdp planning

AN Bajpai, S Garg - Advances in Neural Information …, 2018 - proceedings.neurips.cc

Abstract Domain-independent probabilistic planners input an MDP description in a factored
representation language such as PPDDL or RDDL, and exploit the specifics of the …

被引用次数：26 相关文章所有 5 个版本

[PDF] aaai.org

Training deep reactive policies for probabilistic planning problems

M Issakkimuthu, A Fern, P Tadepalli - Proceedings of the International …, 2018 - ojs.aaai.org

State-of-the-art probabilistic planners typically apply look-ahead search and reasoning at
each step to make a decision. While this approach can enable high-quality decisions, it can …

被引用次数：42 相关文章所有 5 个版本

[PDF] aaai.org

Learning generalized reactive policies using deep neural networks

E Groshev, M Goldstein, A Tamar… - Proceedings of the …, 2018 - ojs.aaai.org

We present a new approach to learning for planning, where knowledge acquired while
solving a given set of planning problems is used to plan faster in related, but new problem …

被引用次数：135 相关文章所有 15 个版本

[PDF] aaai.org

Reinforcement learning for classical planning: Viewing heuristics as dense reward generators

C Gehring, M Asai, R Chitnis, T Silver… - Proceedings of the …, 2022 - ojs.aaai.org

Recent advances in reinforcement learning (RL) have led to a growing interest in applying
RL to classical planning domains or applying classical planning methods to some complex …

被引用次数：36 相关文章所有 9 个版本

[PDF] jair.org

Asnets: Deep learning for generalised planning

S Toyer, S Thiébaux, F Trevizan, L Xie - Journal of Artificial Intelligence …, 2020 - jair.org

In this paper, we discuss the learning of generalised policies for probabilistic and classical
planning problems using Action Schema Networks (ASNets). The ASNet is a neural network …

被引用次数：78 相关文章所有 12 个版本

[PDF] aaai.org

Action schema networks: Generalised policies with deep learning

S Toyer, F Trevizan, S Thiébaux, L Xie - Proceedings of the AAAI …, 2018 - ojs.aaai.org

In this paper, we introduce the Action Schema Network (ASNet): a neural network
architecture for learning generalised policies for probabilistic planning problems. By …

被引用次数：117 相关文章所有 16 个版本

[PDF] aaai.org

Deep reactive policies for planning in stochastic nonlinear domains

TP Bueno, LN de Barros, DD Mauá… - Proceedings of the AAAI …, 2019 - ojs.aaai.org

Recent advances in applying deep learning to planning have shown that Deep Reactive
Policies (DRPs) can be powerful for fast decision-making in complex environments …

被引用次数：32 相关文章所有 9 个版本

[PDF] mlr.press

World model as a graph: Learning latent landmarks for planning

L Zhang, G Yang, BC Stadie - International conference on …, 2021 - proceedings.mlr.press

Planning, the ability to analyze the structure of a problem in the large and decompose it into
interrelated subproblems, is a hallmark of human intelligence. While deep reinforcement …

被引用次数：74 相关文章所有 7 个版本

[PDF] mlr.press

Continuous neural algorithmic planners

Y He, P Veličković, P Liò… - Learning on Graphs …, 2022 - proceedings.mlr.press

Neural algorithmic reasoning studies the problem of learning algorithms with neural
networks, especially using graph architectures. A recent proposal, XLVIN, reaps the benefits …

被引用次数：6 相关文章所有 5 个版本

高级搜索

QQ 群

Size independent neural transfer for rddl planning

Transfer of deep reactive policies for mdp planning

Training deep reactive policies for probabilistic planning problems

Learning generalized reactive policies using deep neural networks

Reinforcement learning for classical planning: Viewing heuristics as dense reward generators

Asnets: Deep learning for generalised planning

Action schema networks: Generalised policies with deep learning

Deep reactive policies for planning in stochastic nonlinear domains

World model as a graph: Learning latent landmarks for planning

Continuous neural algorithmic planners

引用