Evaluating state-space abstractions in extensive-form games

A Wong, T Bäck, AV Kononova, A Plaat - Artificial Intelligence Review, 2023 - Springer

This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …

被引用次数：111 相关文章所有 8 个版本

[PDF] neurips.cc

A unified game-theoretic approach to multiagent reinforcement learning

M Lanctot, V Zambaldi, A Gruslys… - Advances in neural …, 2017 - proceedings.neurips.cc

There has been a resurgence of interest in multiagent reinforcement learning (MARL), due
partly to the recent success of deep neural networks. The simplest form of MARL is …

被引用次数：785 相关文章所有 15 个版本

[PDF] arxiv.org

Deep reinforcement learning from self-play in imperfect-information games

J Heinrich, D Silver - arXiv preprint arXiv:1603.01121, 2016 - arxiv.org

Many real-world applications can be described as large-scale games of imperfect
information. To deal with these challenging domains, prior work has focused on computing …

被引用次数：517 相关文章所有 6 个版本

[PDF] mlr.press

Deep counterfactual regret minimization

N Brown, A Lerer, S Gross… - … conference on machine …, 2019 - proceedings.mlr.press

Abstract Counterfactual Regret Minimization (CFR) is the leading algorithm for solving large
imperfect-information games. It converges to an equilibrium by iteratively traversing the …

被引用次数：280 相关文章所有 7 个版本

[PDF] neurips.cc

Depth-limited solving for imperfect-information games

N Brown, T Sandholm, B Amos - Advances in neural …, 2018 - proceedings.neurips.cc

A fundamental challenge in imperfect-information games is that states do not have well-
defined values. As a result, depth-limited search algorithms used in single-agent settings …

被引用次数：98 相关文章所有 12 个版本

[PDF] aaai.org

Solving imperfect information games using decomposition

N Burch, M Johanson, M Bowling - … of the AAAI Conference on Artificial …, 2014 - ojs.aaai.org

Decomposition, ie independently analyzing possible subgames, has proven to be an
essential principle for effective decision-making in perfect information games. However, in …

被引用次数：122 相关文章所有 16 个版本

[PDF] openreview.net

Actor-critic policy optimization in a large-scale imperfect-information game

H Fu, W Liu, S Wu, Y Wang, T Yang, K Li… - International …, 2021 - openreview.net

The deep policy gradient method has demonstrated promising results in many large-scale
games, where the agent learns purely from its own experience. Yet, policy gradient methods …

被引用次数：29 相关文章所有 2 个版本

[PDF] mlanctot.info

[PDF][PDF] Online Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games.

V Lisý, M Lanctot, MH Bowling - AAMAS, 2015 - mlanctot.info

Online search in games has been a core interest of artificial intelligence. Search in imperfect
information games (eg, Poker, Bridge, Skat) is particularly challenging due to the …

被引用次数：80 相关文章所有 9 个版本

[PDF] aaai.org

[PDF][PDF] Hierarchical abstraction, distributed equilibrium computation, and post-processing, with application to a champion no-limit Texas Hold'em agent

N Brown, S Ganzfried, T Sandholm - Workshops at the twenty-ninth …, 2015 - cdn.aaai.org

The leading approach for solving large imperfect-information games is automated
abstraction followed by running an equilibrium-finding algorithm. We introduce a distributed …

被引用次数：87 相关文章所有 9 个版本

[PDF] aaai.org

Abstraction for solving large incomplete-information games

T Sandholm - Proceedings of the AAAI Conference on Artificial …, 2015 - ojs.aaai.org

Most real-world games and many recreational games are games of incomplete information.
Over the last dozen years, abstraction has emerged as a key enabler for solving large …

被引用次数：73 相关文章所有 8 个版本

高级搜索

QQ 群