Size independent neural transfer for rddl planning

S Garg, A Bajpai - Proceedings of the International Conference on …, 2019 - aaai.org
Neural planners for RDDL MDPs produce deep reactive policies in an offline fashion. These
scale well with large domains, but are sample inefficient and time-consuming to train from …

Delete relaxations for planning with state-dependent action costs

F Geißer, T Keller, R Mattmüller - Proceedings of the International …, 2015 - ojs.aaai.org
Supporting state-dependent action costs in planning admits a more compact representation
of many tasks. We generalize the additive heuristic and compute it by embedding decision …

An English-language argumentation interface for explanation generation with Markov decision processes in the domain of academic advising

T Dodson, N Mattei, JT Guerin… - ACM Transactions on …, 2013 - dl.acm.org
A Markov Decision Process (MDP) policy presents, for each state, an action, which
preferably maximizes the expected utility accrual over time. In this article, we present a novel …

MyPDDL: Tools for efficiently creating PDDL domains and problems

V Strobel, A Kirsch - Knowledge Engineering Tools and Techniques for AI …, 2020 - Springer
Abstract The Planning Domain Definition Language (PDDL) is the state-of-the-art language
for specifying planning problems in artificial intelligence research. Writing and maintaining …

Oga-uct: On-the-go abstractions in uct

A Anand, R Noothigattu, P Singla - Proceedings of the International …, 2016 - ojs.aaai.org
Recent work has begun exploring the value of domain abstractions in Monte-Carlo Tree
Search (MCTS) algorithms for probabilistic planning. These algorithms automatically …

Sample-based tree search with fixed and adaptive state abstractions

J Hostetler, A Fern, T Dietterich - Journal of Artificial Intelligence Research, 2017 - jair.org
Sample-based tree search (SBTS) is an approach to solving Markov decision problems
based on constructing a lookahead search tree using random samples from a generative …

JaxPlan and GurobiPlan: Optimization Baselines for Replanning in Discrete and Mixed Discrete-Continuous Probabilistic Domains

M Gimelfarb, A Taitler, S Sanner - Proceedings of the International …, 2024 - ojs.aaai.org
Replanning methods that determinize a stochastic planning problem and replan at each
action step have long been known to provide strong baseline (and even competition …

Generalised task planning with first-order function approximation

JHA Ng, RPA Petrick - Conference on Robot Learning, 2022 - proceedings.mlr.press
Real world robotics often operates in uncertain and dynamic environments where
generalisation over different scenarios is of practical interest. In the absence of a model …

A Decision Framework for AR, Dialogue and Eye Gaze to Enhance Human-Robot Collaboration

C Zou, Y Ding, K Chandan, S Zhang - Companion of the 2024 ACM/IEEE …, 2024 - dl.acm.org
Enabling an intuitive, bidirectional communication with real-time feedback to convey
intentions and goals is essential in human-robot collaboration (HRC). In this paper, we …

Decreasing uncertainty in planning with state prediction

S Krivic, M Cashmore, D Magazzeni, B Ridder… - 2017 - strathprints.strath.ac.uk
In real world environments the state is almost never completely known. Exploration is often
expensive. The application of planning in these environments is consequently more difficult …