On the undecidability of probabilistic planning and related stochastic optimization problems

O Madani, S Hanks, A Condon - Artificial Intelligence, 2003 - Elsevier
Automated planning, the problem of how an agent achieves a goal given a repertoire of
actions, is one of the foundational and most widely studied problems in the AI literature. The …

[图书][B] Adaptive treatment strategies in practice: planning trials and analyzing data for personalized medicine

MR Kosorok, EEM Moodie - 2015 - SIAM
The study of new medical treatments, and sequences of treatments, is inextricably linked
with statistics. Without statistical estimation and inference, we are left with case studies and …

Reward shaping in episodic reinforcement learning

M Grzes - 2017 - kar.kent.ac.uk
Recent advancements in reinforcement learning confirm that reinforcement learning
techniques can solve large scale problems leading to high quality autonomous decision …

From decision theory to decision aiding methodology

A Tsoukiàs - European journal of operational research, 2008 - Elsevier
The paper presents the author's partial and personal historical reconstruction of how
decision theory is evolving to a decision aiding methodology. The presentation shows …

[PDF][PDF] PPDDL1. 0: An extension to PDDL for expressing planning domains with probabilistic effects

HLS Younes, ML Littman - … . Rep. CMU-CS …, 2004 - reports-archive.adm.cs.cmu.edu
We desribe a variation of the planning domain definition language, PDDL, that permits the
modeling of probabilistic planning problems with rewards. This language, PPDDL1. 0, was …

Game theory and decision theory in multi-agent systems

S Parsons, M Wooldridge - Autonomous Agents and Multi-Agent Systems, 2002 - Springer
In the last few years, there has been increasing interest from the agent community in the use
of techniques from decision theory and game theory. Our aims in this article are firstly to …

[PDF][PDF] Plan Stability: Replanning versus Plan Repair.

M Fox, A Gerevini, D Long, I Serina - ICAPS, 2006 - cdn.aaai.org
The ultimate objective in planning is to construct plans for execution. However, when a plan
is executed in a real environment it can encounter differences between the expected and …

An artificial intelligence perspective on autonomic computing policies

JO Kephart, WE Walsh - Proceedings. Fifth IEEE International …, 2004 - ieeexplore.ieee.org
We introduce a unified framework that interrelates three different types of policies that will be
used in autonomic computing system: action, goal, and utility function policies. Our policy …

What you should know about approximate dynamic programming

WB Powell - Naval Research Logistics (NRL), 2009 - Wiley Online Library
Approximate dynamic programming (ADP) is a broad umbrella for a modeling and
algorithmic strategy for solving problems that are sometimes large and complex, and are …

Complex system reliability modelling with dynamic object oriented Bayesian networks (DOOBN)

P Weber, L Jouffe - Reliability Engineering & System Safety, 2006 - Elsevier
Nowadays, the complex manufacturing processes have to be dynamically modelled and
controlled to optimise the diagnosis and the maintenance policies. This article presents a …