相关文章- 学术资源搜索

Sample-based tree search with fixed and adaptive state abstractions

J Hostetler, A Fern, T Dietterich - Journal of Artificial Intelligence Research, 2017 - jair.org

Sample-based tree search (SBTS) is an approach to solving Markov decision problems
based on constructing a lookahead search tree using random samples from a generative …

被引用次数：13 相关文章所有 4 个版本

[PDF] aaai.org

State aggregation in Monte Carlo tree search

J Hostetler, A Fern, T Dietterich - … of the AAAI Conference on Artificial …, 2014 - ojs.aaai.org

Monte Carlo tree search (MCTS) algorithms are a popular approach to online decision-
making in Markov decision processes (MDPs). These algorithms can, however, perform …

被引用次数：49 相关文章所有 9 个版本

[PDF] arxiv.org

Bootstrapping Monte Carlo tree search with an imperfect heuristic

THD Nguyen, WS Lee, TY Leong - … PKDD 2012, Bristol, UK, September 24 …, 2012 - Springer

We consider the problem of using a heuristic policy to improve the value approximation by
the Upper Confidence Bound applied in Trees (UCT) algorithm in non-adversarial settings …

被引用次数：7 相关文章所有 11 个版本

[PDF] aaai.org

An analysis of monte carlo tree search

S James, G Konidaris, B Rosman - … of the AAAI Conference on Artificial …, 2017 - ojs.aaai.org

Abstract Monte Carlo Tree Search (MCTS) is a family of directed search algorithms that has
gained widespread attention in recent years. Despite the vast amount of research into …

被引用次数：68 相关文章所有 18 个版本

[PDF] aaai.org

Bayesian optimized monte carlo planning

J Mern, A Yildiz, Z Sunberg, T Mukerji… - Proceedings of the …, 2021 - ojs.aaai.org

Online solvers for partially observable Markov decision processes have difficulty scaling to
problems with large action spaces. Monte Carlo tree search with progressive widening …

被引用次数：31 相关文章所有 5 个版本

[PDF] aaai.org

Confidence backup updates for aggregating mdp state values in monte-carlo tree search

Z Bnaya, A Palombo, R Puzis, A Felner - Proceedings of the …, 2015 - ojs.aaai.org

Abstract Monte-Carlo Tree Search (MCTS) algorithms estimate the value of MDP states
based on rewards received by performing multiple random simulations. MCTS algorithms …

被引用次数：4 相关文章所有 9 个版本

[PDF] tu-berlin.de

[PDF][PDF] Active tree search

R Lieck, M Toussaint - ICAPS Workshop on Planning, Search …, 2017 - argmin.lis.tu-berlin.de

Monte-Carlo tree search is based on contiguous rollouts. Since not all samples within a
rollout necessarily provide relevant information, contiguous rollouts may be wasteful as …

被引用次数：4 相关文章所有 6 个版本

[PDF] aaai.org

Hierarchical monte-carlo planning

NA Vien, M Toussaint - Proceedings of the AAAI Conference on …, 2015 - ojs.aaai.org

Abstract Monte-Carlo Tree Search, especially UCT and its POMDP version POMCP, have
demonstrated excellent performanceon many problems. However, to efficiently scale to …

被引用次数：58 相关文章所有 8 个版本

[PDF] icaps-conference.org

[PDF][PDF] Approaching Bayes-optimalilty using Monte-Carlo tree search

J Asmuth, ML Littman - Proc. 21st Int. Conf. Automat …, 2011 - icaps11.icaps-conference.org

Bayes-optimal behavior, while well-defined, is often difficult to achieve. Recent advances in
the use of Monte-Carlo tree search (MCTS) have shown that it is possible to act …

被引用次数：30 相关文章所有 3 个版本

[PDF] academia.edu

[PDF][PDF] Using Linear Programming for Bayesian Exploration in Markov Decision Processes.

PS Castro, D Precup - IJCAI, 2007 - academia.edu

A key problem in reinforcement learning is finding a good balance between the need to
explore the environment and the need to gain rewards by exploiting existing knowledge …

被引用次数：34 相关文章所有 7 个版本

高级搜索

QQ 群

Sample-based tree search with fixed and adaptive state abstractions

State aggregation in Monte Carlo tree search

Bootstrapping Monte Carlo tree search with an imperfect heuristic

An analysis of monte carlo tree search

Bayesian optimized monte carlo planning

Confidence backup updates for aggregating mdp state values in monte-carlo tree search

[PDF][PDF] Active tree search

Hierarchical monte-carlo planning

[PDF][PDF] Approaching Bayes-optimalilty using Monte-Carlo tree search

[PDF][PDF] Using Linear Programming for Bayesian Exploration in Markov Decision Processes.

引用