Size independent neural transfer for rddl planning

S Garg, A Bajpai - Proceedings of the International Conference on …, 2019 - aaai.org
Neural planners for RDDL MDPs produce deep reactive policies in an offline fashion. These
scale well with large domains, but are sample inefficient and time-consuming to train from …

Transfer of deep reactive policies for mdp planning

AN Bajpai, S Garg - Advances in Neural Information …, 2018 - proceedings.neurips.cc
Abstract Domain-independent probabilistic planners input an MDP description in a factored
representation language such as PPDDL or RDDL, and exploit the specifics of the …

Training deep reactive policies for probabilistic planning problems

M Issakkimuthu, A Fern, P Tadepalli - Proceedings of the International …, 2018 - ojs.aaai.org
State-of-the-art probabilistic planners typically apply look-ahead search and reasoning at
each step to make a decision. While this approach can enable high-quality decisions, it can …

Learning generalized reactive policies using deep neural networks

E Groshev, M Goldstein, A Tamar… - Proceedings of the …, 2018 - ojs.aaai.org
We present a new approach to learning for planning, where knowledge acquired while
solving a given set of planning problems is used to plan faster in related, but new problem …

Reinforcement learning for classical planning: Viewing heuristics as dense reward generators

C Gehring, M Asai, R Chitnis, T Silver… - Proceedings of the …, 2022 - ojs.aaai.org
Recent advances in reinforcement learning (RL) have led to a growing interest in applying
RL to classical planning domains or applying classical planning methods to some complex …

Asnets: Deep learning for generalised planning

S Toyer, S Thiébaux, F Trevizan, L Xie - Journal of Artificial Intelligence …, 2020 - jair.org
In this paper, we discuss the learning of generalised policies for probabilistic and classical
planning problems using Action Schema Networks (ASNets). The ASNet is a neural network …

Action schema networks: Generalised policies with deep learning

S Toyer, F Trevizan, S Thiébaux, L Xie - Proceedings of the AAAI …, 2018 - ojs.aaai.org
In this paper, we introduce the Action Schema Network (ASNet): a neural network
architecture for learning generalised policies for probabilistic planning problems. By …

Deep reactive policies for planning in stochastic nonlinear domains

TP Bueno, LN de Barros, DD Mauá… - Proceedings of the AAAI …, 2019 - ojs.aaai.org
Recent advances in applying deep learning to planning have shown that Deep Reactive
Policies (DRPs) can be powerful for fast decision-making in complex environments …

World model as a graph: Learning latent landmarks for planning

L Zhang, G Yang, BC Stadie - International conference on …, 2021 - proceedings.mlr.press
Planning, the ability to analyze the structure of a problem in the large and decompose it into
interrelated subproblems, is a hallmark of human intelligence. While deep reinforcement …

Continuous neural algorithmic planners

Y He, P Veličković, P Liò… - Learning on Graphs …, 2022 - proceedings.mlr.press
Neural algorithmic reasoning studies the problem of learning algorithms with neural
networks, especially using graph architectures. A recent proposal, XLVIN, reaps the benefits …