Evolutionary computation for reinforcement learning

S Whiteson - Reinforcement Learning: State-of-the-art, 2012 - Springer
Algorithms for evolutionary computation, which simulate the process of natural selection to
solve optimization problems, are an effective tool for discovering high-performing …

Multistage stochastic programming: A scenario tree based approach to planning under uncertainty

B Defourny, D Ernst, L Wehenkel - Decision theory models for …, 2012 - igi-global.com
In this chapter, we present the multistage stochastic programming framework for sequential
decision making under uncertainty. We discuss its differences with Markov Decision …

Formalizing and enforcing purpose restrictions in privacy policies

MC Tschantz, A Datta, JM Wing - 2012 IEEE Symposium on …, 2012 - ieeexplore.ieee.org
Privacy policies often place restrictions on the purposes for which a governed entity may use
personal information. For example, regulations, such as the Health Insurance Portability and …

Lifted linear programming

M Mladenov, B Ahmadi… - Artificial Intelligence and …, 2012 - proceedings.mlr.press
Lifted inference approaches have rendered large, previously intractable probabilistic
inference problems quickly solvable by handling whole sets of indistinguishable objects …

Worst-case analysis of strategy iteration and the simplex method

TD Hansen - 2012 - pure.au.dk
In this dissertation we study strategy iteration (also known as policy iteration) algorithms for
solving Markov decision processes (MDPs) and two-player turn-based stochastic games …

Online Planning for Large MDPs with MAXQ-like Decomposition

A Bai, F Wu, X Chen - … of the 8th International Conference on …, 2012 - staff.ustc.edu.cn
Markov decision processes (MDPs) provide an expressive framework for planning in
stochastic domains. However, exactly solving a large MDP is often intractable due to the …

On-line reinforcement learning using incremental kernel-based stochastic factorization

A Barreto, D Precup, J Pineau - Advances in Neural …, 2012 - proceedings.neurips.cc
The ability to learn a policy for a sequential decision problem with continuous state space
using on-line data is a long-standing challenge. This paper presents a new reinforcement …

Optimal intervention strategies for therapeutic methods with fixed-length duration of drug effectiveness

MR Yousefi, A Datta… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
Intervention in gene regulatory networks in the context of Markov decision processes has
usually involved finding an optimal one-transition policy, where a decision is made at every …

Uav collision avoidance using a* algorithm

T Liao - 2012 - search.proquest.com
Collision avoidance is the essential requirement for unmanned aerial vehicles (UAVs) to
become fully autonomous. Several algorithms have been proposed to do the path planning …

Motion planning and stochastic control with experimental validation on a planetary rover

R McAllister, T Peynot, R Fitch… - 2012 IEEE/RSJ …, 2012 - ieeexplore.ieee.org
Motion planning for planetary rovers must consider control uncertainty in order to maintain
the safety of the platform during navigation. Modelling such control uncertainty is difficult due …