Decision-theoretic planning: Structural assumptions and computational leverage

O Madani, S Hanks, A Condon - Artificial Intelligence, 2003 - Elsevier

Automated planning, the problem of how an agent achieves a goal given a repertoire of
actions, is one of the foundational and most widely studied problems in the AI literature. The …

被引用次数：339 相关文章所有 8 个版本

[PDF] siam.org

[图书][B] Adaptive treatment strategies in practice: planning trials and analyzing data for personalized medicine

MR Kosorok, EEM Moodie - 2015 - SIAM

The study of new medical treatments, and sequences of treatments, is inextricably linked
with statistics. Without statistical estimation and inference, we are left with case studies and …

被引用次数：169 相关文章所有 6 个版本

[PDF] kent.ac.uk

Reward shaping in episodic reinforcement learning

M Grzes - 2017 - kar.kent.ac.uk

Recent advancements in reinforcement learning confirm that reinforcement learning
techniques can solve large scale problems leading to high quality autonomous decision …

被引用次数：144 相关文章所有 11 个版本

[PDF] psu.edu

From decision theory to decision aiding methodology

A Tsoukiàs - European journal of operational research, 2008 - Elsevier

The paper presents the author's partial and personal historical reconstruction of how
decision theory is evolving to a decision aiding methodology. The presentation shows …

被引用次数：336 相关文章所有 18 个版本

[PDF] cmu.edu

[PDF][PDF] PPDDL1. 0: An extension to PDDL for expressing planning domains with probabilistic effects

HLS Younes, ML Littman - … . Rep. CMU-CS …, 2004 - reports-archive.adm.cs.cmu.edu

We desribe a variation of the planning domain definition language, PDDL, that permits the
modeling of probabilistic planning problems with rewards. This language, PPDDL1. 0, was …

被引用次数：318 相关文章所有 9 个版本

[PDF] researchgate.net

Game theory and decision theory in multi-agent systems

S Parsons, M Wooldridge - Autonomous Agents and Multi-Agent Systems, 2002 - Springer

In the last few years, there has been increasing interest from the agent community in the use
of techniques from decision theory and game theory. Our aims in this article are firstly to …

被引用次数：335 相关文章所有 13 个版本

[PDF] aaai.org

[PDF][PDF] Plan Stability: Replanning versus Plan Repair.

M Fox, A Gerevini, D Long, I Serina - ICAPS, 2006 - cdn.aaai.org

The ultimate objective in planning is to construct plans for execution. However, when a plan
is executed in a real environment it can encounter differences between the expected and …

被引用次数：341 相关文章所有 12 个版本

[PDF] psu.edu

An artificial intelligence perspective on autonomic computing policies

JO Kephart, WE Walsh - Proceedings. Fifth IEEE International …, 2004 - ieeexplore.ieee.org

We introduce a unified framework that interrelates three different types of policies that will be
used in autonomic computing system: action, goal, and utility function policies. Our policy …

被引用次数：445 相关文章所有 7 个版本

[PDF] psu.edu

What you should know about approximate dynamic programming

WB Powell - Naval Research Logistics (NRL), 2009 - Wiley Online Library

Approximate dynamic programming (ADP) is a broad umbrella for a modeling and
algorithmic strategy for solving problems that are sometimes large and complex, and are …

被引用次数：259 相关文章所有 6 个版本

[PDF] hal.science

Complex system reliability modelling with dynamic object oriented Bayesian networks (DOOBN)

P Weber, L Jouffe - Reliability Engineering & System Safety, 2006 - Elsevier

Nowadays, the complex manufacturing processes have to be dynamically modelled and
controlled to optimise the diagnosis and the maintenance policies. This article presents a …

被引用次数：356 相关文章所有 15 个版本

高级搜索

QQ 群