Deep counterfactual regret minimization

N Brown, A Lerer, S Gross… - … conference on machine …, 2019 - proceedings.mlr.press
Abstract Counterfactual Regret Minimization (CFR) is the leading algorithm for solving large
imperfect-information games. It converges to an equilibrium by iteratively traversing the …

Last-iterate convergence of optimistic gradient method for monotone variational inequalities

E Gorbunov, A Taylor, G Gidel - Advances in neural …, 2022 - proceedings.neurips.cc
Abstract The Past Extragradient (PEG)[Popov, 1980] method, also known as the Optimistic
Gradient method, has known a recent gain in interest in the optimization community with the …

Convergence of proximal point and extragradient-based methods beyond monotonicity: the case of negative comonotonicity

E Gorbunov, A Taylor, S Horváth… - … on Machine Learning, 2023 - proceedings.mlr.press
Algorithms for min-max optimization and variational inequalities are often studied under
monotonicity assumptions. Motivated by non-monotone machine learning applications, we …

Uncoupled Learning Dynamics with Swap Regret in Multiplayer Games

I Anagnostides, G Farina, C Kroer… - Advances in …, 2022 - proceedings.neurips.cc
In this paper we establish efficient and\emph {uncoupled} learning dynamics so that, when
employed by all players in a general-sum multiplayer game, the\emph {swap regret} of each …

Suspicion-agent: Playing imperfect information games with theory of mind aware gpt-4

J Guo, B Yang, P Yoo, BY Lin, Y Iwasawa… - arXiv preprint arXiv …, 2023 - arxiv.org
Unlike perfect information games, where all elements are known to every player, imperfect
information games emulate the real-world complexities of decision-making under uncertain …

A survey of opponent modeling in adversarial domains

S Nashed, S Zilberstein - Journal of Artificial Intelligence Research, 2022 - jair.org
Opponent modeling is the ability to use prior knowledge and observations in order to predict
the behavior of an opponent. This survey presents a comprehensive overview of existing …

Kernelized multiplicative weights for 0/1-polyhedral games: Bridging the gap between learning in extensive-form and normal-form games

G Farina, CW Lee, H Luo… - … Conference on Machine …, 2022 - proceedings.mlr.press
While extensive-form games (EFGs) can be converted into normal-form games (NFGs),
doing so comes at the cost of an exponential blowup of the strategy space. So, progress on …

Last-iterate convergence in extensive-form games

CW Lee, C Kroer, H Luo - Advances in Neural Information …, 2021 - proceedings.neurips.cc
Regret-based algorithms are highly efficient at finding approximate Nash equilibria in
sequential games such as poker games. However, most regret-based algorithms, including …

Faster game solving via predictive blackwell approachability: Connecting regret matching and mirror descent

G Farina, C Kroer, T Sandholm - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
Blackwell approachability is a framework for reasoning about repeated games with vector-
valued payoffs. We introduce predictive Blackwell approachability, where an estimate of the …

Block-coordinate methods and restarting for solving extensive-form games

D Chakrabarti, J Diakonikolas… - Advances in Neural …, 2023 - proceedings.neurips.cc
Coordinate descent methods are popular in machine learning and optimization for their
simple sparse updates and excellent practical performance. In the context of large-scale …