Sample-based tree search with fixed and adaptive state abstractions

J Hostetler, A Fern, T Dietterich - Journal of Artificial Intelligence Research, 2017 - jair.org
Sample-based tree search (SBTS) is an approach to solving Markov decision problems
based on constructing a lookahead search tree using random samples from a generative …

State aggregation in Monte Carlo tree search

J Hostetler, A Fern, T Dietterich - … of the AAAI Conference on Artificial …, 2014 - ojs.aaai.org
Monte Carlo tree search (MCTS) algorithms are a popular approach to online decision-
making in Markov decision processes (MDPs). These algorithms can, however, perform …

Bootstrapping Monte Carlo tree search with an imperfect heuristic

THD Nguyen, WS Lee, TY Leong - … PKDD 2012, Bristol, UK, September 24 …, 2012 - Springer
We consider the problem of using a heuristic policy to improve the value approximation by
the Upper Confidence Bound applied in Trees (UCT) algorithm in non-adversarial settings …

An analysis of monte carlo tree search

S James, G Konidaris, B Rosman - … of the AAAI Conference on Artificial …, 2017 - ojs.aaai.org
Abstract Monte Carlo Tree Search (MCTS) is a family of directed search algorithms that has
gained widespread attention in recent years. Despite the vast amount of research into …

Bayesian optimized monte carlo planning

J Mern, A Yildiz, Z Sunberg, T Mukerji… - Proceedings of the …, 2021 - ojs.aaai.org
Online solvers for partially observable Markov decision processes have difficulty scaling to
problems with large action spaces. Monte Carlo tree search with progressive widening …

Confidence backup updates for aggregating mdp state values in monte-carlo tree search

Z Bnaya, A Palombo, R Puzis, A Felner - Proceedings of the …, 2015 - ojs.aaai.org
Abstract Monte-Carlo Tree Search (MCTS) algorithms estimate the value of MDP states
based on rewards received by performing multiple random simulations. MCTS algorithms …

[PDF][PDF] Active tree search

R Lieck, M Toussaint - ICAPS Workshop on Planning, Search …, 2017 - argmin.lis.tu-berlin.de
Monte-Carlo tree search is based on contiguous rollouts. Since not all samples within a
rollout necessarily provide relevant information, contiguous rollouts may be wasteful as …

Hierarchical monte-carlo planning

NA Vien, M Toussaint - Proceedings of the AAAI Conference on …, 2015 - ojs.aaai.org
Abstract Monte-Carlo Tree Search, especially UCT and its POMDP version POMCP, have
demonstrated excellent performanceon many problems. However, to efficiently scale to …

[PDF][PDF] Approaching Bayes-optimalilty using Monte-Carlo tree search

J Asmuth, ML Littman - Proc. 21st Int. Conf. Automat …, 2011 - icaps11.icaps-conference.org
Bayes-optimal behavior, while well-defined, is often difficult to achieve. Recent advances in
the use of Monte-Carlo tree search (MCTS) have shown that it is possible to act …

[PDF][PDF] Using Linear Programming for Bayesian Exploration in Markov Decision Processes.

PS Castro, D Precup - IJCAI, 2007 - academia.edu
A key problem in reinforcement learning is finding a good balance between the need to
explore the environment and the need to gain rewards by exploiting existing knowledge …