Searching for solutions in games and artificial intelligence

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1309 相关文章所有 7 个版本

Computer‐assisted synthetic planning: the end of the beginning

S Szymkuć, EP Gajewska, T Klucznik… - Angewandte Chemie …, 2016 - Wiley Online Library

Exactly half a century has passed since the launch of the first documented research project
(1965 Dendral) on computer‐assisted organic synthesis. Many more programs were created …

被引用次数：528 相关文章所有 9 个版本

[PDF] arxiv.org

Mastering chess and shogi by self-play with a general reinforcement learning algorithm

D Silver, T Hubert, J Schrittwieser, I Antonoglou… - arXiv preprint arXiv …, 2017 - arxiv.org

The game of chess is the most widely-studied domain in the history of artificial intelligence.
The strongest programs are based on a combination of sophisticated search techniques …

被引用次数：2208 相关文章所有 20 个版本

[PDF] ucl.ac.uk

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai… - Science, 2018 - science.org

The game of chess is the longest-studied domain in the history of artificial intelligence. The
strongest programs are based on a combination of sophisticated search techniques, domain …

被引用次数：4391 相关文章所有 21 个版本

[PDF] arxiv.org

Deepstack: Expert-level artificial intelligence in heads-up no-limit poker

M Moravčík, M Schmid, N Burch, V Lisý, D Morrill… - Science, 2017 - science.org

Artificial intelligence has seen several breakthroughs in recent years, with games often
serving as milestones. A common feature of these games is that players have perfect …

被引用次数：1168 相关文章所有 22 个版本

[PDF] ualberta.ca

Heads-up limit hold'em poker is solved

M Bowling, N Burch, M Johanson, O Tammelin - Science, 2015 - science.org

Poker is a family of games that exhibit imperfect information, where players do not have full
knowledge of past events. Whereas many perfect-information games have been solved (eg …

被引用次数：587 相关文章所有 15 个版本

[PDF] academia.edu

Mastering the game of Go with deep neural networks and tree search

D Silver, A Huang, CJ Maddison, A Guez, L Sifre… - nature, 2016 - nature.com

The game of Go has long been viewed as the most challenging of classic games for artificial
intelligence owing to its enormous search space and the difficulty of evaluating board …

被引用次数：18959 相关文章所有 98 个版本

[PDF] hal.science

Efficient selectivity and backup operators in Monte-Carlo tree search

R Coulom - International conference on computers and games, 2006 - Springer

A Monte-Carlo evaluation consists in estimating a position by averaging the outcome of
several random continuations. The method can serve as an evaluation function at the leaves …

被引用次数：1978 相关文章所有 22 个版本

[PDF] mlr.press

Adversarial policies beat superhuman go AIs

TT Wang, A Gleave, T Tseng, K Pelrine… - International …, 2023 - proceedings.mlr.press

We attack the state-of-the-art Go-playing AI system KataGo by training adversarial policies
against it, achieving a $> $97% win rate against KataGo running at superhuman settings …

被引用次数：17 相关文章所有 9 个版本

[PDF] arxiv.org

Born to learn: the inspiration, progress, and future of evolved plastic artificial neural networks

A Soltoggio, KO Stanley, S Risi - Neural Networks, 2018 - Elsevier

Biological neural networks are systems of extraordinary computational capabilities shaped
by evolution, development, and lifelong learning. The interplay of these elements leads to …

被引用次数：159 相关文章所有 18 个版本

高级搜索

QQ 群