Deep multiagent reinforcement learning: Challenges and directions

A Wong, T Bäck, AV Kononova, A Plaat - Artificial Intelligence Review, 2023 - Springer
This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …

A unified game-theoretic approach to multiagent reinforcement learning

M Lanctot, V Zambaldi, A Gruslys… - Advances in neural …, 2017 - proceedings.neurips.cc
There has been a resurgence of interest in multiagent reinforcement learning (MARL), due
partly to the recent success of deep neural networks. The simplest form of MARL is …

Deep reinforcement learning from self-play in imperfect-information games

J Heinrich, D Silver - arXiv preprint arXiv:1603.01121, 2016 - arxiv.org
Many real-world applications can be described as large-scale games of imperfect
information. To deal with these challenging domains, prior work has focused on computing …

Deep counterfactual regret minimization

N Brown, A Lerer, S Gross… - … conference on machine …, 2019 - proceedings.mlr.press
Abstract Counterfactual Regret Minimization (CFR) is the leading algorithm for solving large
imperfect-information games. It converges to an equilibrium by iteratively traversing the …

Depth-limited solving for imperfect-information games

N Brown, T Sandholm, B Amos - Advances in neural …, 2018 - proceedings.neurips.cc
A fundamental challenge in imperfect-information games is that states do not have well-
defined values. As a result, depth-limited search algorithms used in single-agent settings …

Solving imperfect information games using decomposition

N Burch, M Johanson, M Bowling - … of the AAAI Conference on Artificial …, 2014 - ojs.aaai.org
Decomposition, ie independently analyzing possible subgames, has proven to be an
essential principle for effective decision-making in perfect information games. However, in …

Actor-critic policy optimization in a large-scale imperfect-information game

H Fu, W Liu, S Wu, Y Wang, T Yang, K Li… - International …, 2021 - openreview.net
The deep policy gradient method has demonstrated promising results in many large-scale
games, where the agent learns purely from its own experience. Yet, policy gradient methods …

[PDF][PDF] Online Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games.

V Lisý, M Lanctot, MH Bowling - AAMAS, 2015 - mlanctot.info
Online search in games has been a core interest of artificial intelligence. Search in imperfect
information games (eg, Poker, Bridge, Skat) is particularly challenging due to the …

[PDF][PDF] Hierarchical abstraction, distributed equilibrium computation, and post-processing, with application to a champion no-limit Texas Hold'em agent

N Brown, S Ganzfried, T Sandholm - Workshops at the twenty-ninth …, 2015 - cdn.aaai.org
The leading approach for solving large imperfect-information games is automated
abstraction followed by running an equilibrium-finding algorithm. We introduce a distributed …

Abstraction for solving large incomplete-information games

T Sandholm - Proceedings of the AAAI Conference on Artificial …, 2015 - ojs.aaai.org
Most real-world games and many recreational games are games of incomplete information.
Over the last dozen years, abstraction has emerged as a key enabler for solving large …