Planning With Deadlines in Stochastic Domains.

LP Kaelbling, ML Littman, AW Moore - Journal of artificial intelligence …, 1996 - jair.org

This paper surveys the field of reinforcement learning from a computer-science perspective.
It is written to be accessible to researchers familiar with machine learning. Both the historical …

被引用次数：11706 相关文章所有 77 个版本

[PDF] jair.org

Decision-theoretic planning: Structural assumptions and computational leverage

C Boutilier, T Dean, S Hanks - Journal of Artificial Intelligence Research, 1999 - jair.org

Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …

被引用次数：1578 相关文章所有 27 个版本

[PDF] sciencedirect.com

A taxonomy for task allocation problems with temporal and ordering constraints

E Nunes, M Manner, H Mitiche, M Gini - Robotics and Autonomous Systems, 2017 - Elsevier

Previous work on assigning tasks to robots has proposed extensive categorizations of
allocation of tasks with and without constraints. The main contribution of this paper is a …

被引用次数：259 相关文章所有 10 个版本

[PDF] psu.edu

[PDF][PDF] Planning, learning and coordination in multiagent decision processes

C Boutilier - TARK, 1996 - Citeseer

There has been a growing interest in AI in the design of multiagent systems, especially in
multiagent cooperative planning. In this paper, we investigate the extent to which methods …

被引用次数：731 相关文章所有 18 个版本

[PDF] brown.edu

[PDF][PDF] Acting optimally in partially observable stochastic domains

AR Cassandra, LP Kaelbling, ML Littman - Aaai, 1994 - cs.brown.edu

In this paper, we describe the partially observable Markov decision process (pomdp)
approach to nding optimal or near-optimal control strategies for partially observable …

被引用次数：1012 相关文章所有 18 个版本

[PDF] arxiv.org

On the complexity of solving Markov decision problems

ML Littman, TL Dean, LP Kaelbling - arXiv preprint arXiv:1302.4971, 2013 - arxiv.org

Markov decision problems (MDPs) provide the foundations for a number of problems of
interest to AI researchers studying automated planning and reinforcement learning. In this …

被引用次数：743 相关文章所有 18 个版本

[PDF] jair.org

Efficient solution algorithms for factored MDPs

C Guestrin, D Koller, R Parr, S Venkataraman - Journal of Artificial …, 2003 - jair.org

This paper addresses the problem of planning under uncertainty in large Markov Decision
Processes (MDPs). Factored MDPs represent a complex state space using state variables …

被引用次数：678 相关文章所有 31 个版本

[PDF] sciencedirect.com

Using temporal logics to express search control knowledge for planning

F Bacchus, F Kabanza - Artificial intelligence, 2000 - Elsevier

Over the years increasingly sophisticated planning algorithms have been developed. These
have made for more efficient planners, but unfortunately these planners still suffer from …

被引用次数：820 相关文章所有 16 个版本

[PDF] sciencedirect.com

Stochastic dynamic programming with factored representations

C Boutilier, R Dearden, M Goldszmidt - Artificial intelligence, 2000 - Elsevier

Markov decision processes (MDPs) have proven to be popular models for decision-theoretic
planning, but standard dynamic programming algorithms for solving MDPs rely on explicit …

被引用次数：631 相关文章所有 21 个版本

[PDF] aaai.org

[PDF][PDF] Exploiting structure in policy construction

C Boutilier, R Dearden, M Goldszmidt - IJCAI, 1995 - cdn.aaai.org

Markov decision processes (MDPs) have recently been applied to the problem of modeling
decision-theoretic planning. While such traditional methods for solving MDPs are often …

被引用次数：574 相关文章所有 18 个版本

高级搜索

QQ 群