On the complexity of solving Markov decision problems

S Whiteson - Reinforcement Learning: State-of-the-art, 2012 - Springer

Algorithms for evolutionary computation, which simulate the process of natural selection to
solve optimization problems, are an effective tool for discovering high-performing …

被引用次数：76 相关文章所有 12 个版本

[PDF] uliege.be

Multistage stochastic programming: A scenario tree based approach to planning under uncertainty

B Defourny, D Ernst, L Wehenkel - Decision theory models for …, 2012 - igi-global.com

In this chapter, we present the multistage stochastic programming framework for sequential
decision making under uncertainty. We discuss its differences with Markov Decision …

被引用次数：100 相关文章所有 10 个版本

[PDF] researchgate.net

Formalizing and enforcing purpose restrictions in privacy policies

MC Tschantz, A Datta, JM Wing - 2012 IEEE Symposium on …, 2012 - ieeexplore.ieee.org

Privacy policies often place restrictions on the purposes for which a governed entity may use
personal information. For example, regulations, such as the Health Insurance Portability and …

被引用次数：60 相关文章所有 22 个版本

[PDF] mlr.press

Lifted linear programming

M Mladenov, B Ahmadi… - Artificial Intelligence and …, 2012 - proceedings.mlr.press

Lifted inference approaches have rendered large, previously intractable probabilistic
inference problems quickly solvable by handling whole sets of indistinguishable objects …

被引用次数：53 相关文章所有 13 个版本

[PDF] au.dk

Worst-case analysis of strategy iteration and the simplex method

TD Hansen - 2012 - pure.au.dk

In this dissertation we study strategy iteration (also known as policy iteration) algorithms for
solving Markov decision processes (MDPs) and two-player turn-based stochastic games …

被引用次数：31 相关文章所有 3 个版本

[PDF] ustc.edu.cn

Online Planning for Large MDPs with MAXQ-like Decomposition

A Bai, F Wu, X Chen - … of the 8th International Conference on …, 2012 - staff.ustc.edu.cn

Markov decision processes (MDPs) provide an expressive framework for planning in
stochastic domains. However, exactly solving a large MDP is often intractable due to the …

被引用次数：32 相关文章所有 14 个版本

[PDF] neurips.cc

On-line reinforcement learning using incremental kernel-based stochastic factorization

A Barreto, D Precup, J Pineau - Advances in Neural …, 2012 - proceedings.neurips.cc

The ability to learn a policy for a sequential decision problem with continuous state space
using on-line data is a long-standing challenge. This paper presents a new reinforcement …

被引用次数：20 相关文章所有 11 个版本

[PDF] researchgate.net

Optimal intervention strategies for therapeutic methods with fixed-length duration of drug effectiveness

MR Yousefi, A Datta… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org

Intervention in gene regulatory networks in the context of Markov decision processes has
usually involved finding an optimal one-transition policy, where a decision is made at every …

被引用次数：25 相关文章所有 10 个版本

[PDF] auburn.edu

Uav collision avoidance using a* algorithm

T Liao - 2012 - search.proquest.com

Collision avoidance is the essential requirement for unmanned aerial vehicles (UAVs) to
become fully autonomous. Several algorithms have been proposed to do the path planning …

被引用次数：22 相关文章所有 2 个版本

[PDF] qut.edu.au

Motion planning and stochastic control with experimental validation on a planetary rover

R McAllister, T Peynot, R Fitch… - 2012 IEEE/RSJ …, 2012 - ieeexplore.ieee.org

Motion planning for planetary rovers must consider control uncertainty in order to maintain
the safety of the platform during navigation. Modelling such control uncertainty is difficult due …

被引用次数：19 相关文章所有 11 个版本

高级搜索

QQ 群