Reinforcement learning: A survey

LP Kaelbling, ML Littman, AW Moore - Journal of artificial intelligence …, 1996 - jair.org
This paper surveys the field of reinforcement learning from a computer-science perspective.
It is written to be accessible to researchers familiar with machine learning. Both the historical …

Decision-theoretic planning: Structural assumptions and computational leverage

C Boutilier, T Dean, S Hanks - Journal of Artificial Intelligence Research, 1999 - jair.org
Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …

A taxonomy for task allocation problems with temporal and ordering constraints

E Nunes, M Manner, H Mitiche, M Gini - Robotics and Autonomous Systems, 2017 - Elsevier
Previous work on assigning tasks to robots has proposed extensive categorizations of
allocation of tasks with and without constraints. The main contribution of this paper is a …

[PDF][PDF] Planning, learning and coordination in multiagent decision processes

C Boutilier - TARK, 1996 - Citeseer
There has been a growing interest in AI in the design of multiagent systems, especially in
multiagent cooperative planning. In this paper, we investigate the extent to which methods …

[PDF][PDF] Acting optimally in partially observable stochastic domains

AR Cassandra, LP Kaelbling, ML Littman - Aaai, 1994 - cs.brown.edu
In this paper, we describe the partially observable Markov decision process (pomdp)
approach to nding optimal or near-optimal control strategies for partially observable …

On the complexity of solving Markov decision problems

ML Littman, TL Dean, LP Kaelbling - arXiv preprint arXiv:1302.4971, 2013 - arxiv.org
Markov decision problems (MDPs) provide the foundations for a number of problems of
interest to AI researchers studying automated planning and reinforcement learning. In this …

Efficient solution algorithms for factored MDPs

C Guestrin, D Koller, R Parr, S Venkataraman - Journal of Artificial …, 2003 - jair.org
This paper addresses the problem of planning under uncertainty in large Markov Decision
Processes (MDPs). Factored MDPs represent a complex state space using state variables …

Using temporal logics to express search control knowledge for planning

F Bacchus, F Kabanza - Artificial intelligence, 2000 - Elsevier
Over the years increasingly sophisticated planning algorithms have been developed. These
have made for more efficient planners, but unfortunately these planners still suffer from …

Stochastic dynamic programming with factored representations

C Boutilier, R Dearden, M Goldszmidt - Artificial intelligence, 2000 - Elsevier
Markov decision processes (MDPs) have proven to be popular models for decision-theoretic
planning, but standard dynamic programming algorithms for solving MDPs rely on explicit …

[PDF][PDF] Exploiting structure in policy construction

C Boutilier, R Dearden, M Goldszmidt - IJCAI, 1995 - cdn.aaai.org
Markov decision processes (MDPs) have recently been applied to the problem of modeling
decision-theoretic planning. While such traditional methods for solving MDPs are often …