Stable-predictive optimistic counterfactual regret minimization

N Brown, A Lerer, S Gross… - … conference on machine …, 2019 - proceedings.mlr.press

Abstract Counterfactual Regret Minimization (CFR) is the leading algorithm for solving large
imperfect-information games. It converges to an equilibrium by iteratively traversing the …

被引用次数：281 相关文章所有 7 个版本

[PDF] neurips.cc

Last-iterate convergence of optimistic gradient method for monotone variational inequalities

E Gorbunov, A Taylor, G Gidel - Advances in neural …, 2022 - proceedings.neurips.cc

Abstract The Past Extragradient (PEG)[Popov, 1980] method, also known as the Optimistic
Gradient method, has known a recent gain in interest in the optimization community with the …

被引用次数：51 相关文章所有 9 个版本

[PDF] mlr.press

Convergence of proximal point and extragradient-based methods beyond monotonicity: the case of negative comonotonicity

E Gorbunov, A Taylor, S Horváth… - … on Machine Learning, 2023 - proceedings.mlr.press

Algorithms for min-max optimization and variational inequalities are often studied under
monotonicity assumptions. Motivated by non-monotone machine learning applications, we …

被引用次数：25 相关文章所有 13 个版本

[PDF] neurips.cc

Uncoupled Learning Dynamics with Swap Regret in Multiplayer Games

I Anagnostides, G Farina, C Kroer… - Advances in …, 2022 - proceedings.neurips.cc

In this paper we establish efficient and\emph {uncoupled} learning dynamics so that, when
employed by all players in a general-sum multiplayer game, the\emph {swap regret} of each …

被引用次数：34 相关文章所有 9 个版本

[PDF] arxiv.org

Suspicion-agent: Playing imperfect information games with theory of mind aware gpt-4

J Guo, B Yang, P Yoo, BY Lin, Y Iwasawa… - arXiv preprint arXiv …, 2023 - arxiv.org

Unlike perfect information games, where all elements are known to every player, imperfect
information games emulate the real-world complexities of decision-making under uncertain …

被引用次数：28 相关文章所有 3 个版本

[PDF] jair.org Full View

A survey of opponent modeling in adversarial domains

S Nashed, S Zilberstein - Journal of Artificial Intelligence Research, 2022 - jair.org

Opponent modeling is the ability to use prior knowledge and observations in order to predict
the behavior of an opponent. This survey presents a comprehensive overview of existing …

被引用次数：43 相关文章所有 9 个版本

[PDF] mlr.press

Kernelized multiplicative weights for 0/1-polyhedral games: Bridging the gap between learning in extensive-form and normal-form games

G Farina, CW Lee, H Luo… - … Conference on Machine …, 2022 - proceedings.mlr.press

While extensive-form games (EFGs) can be converted into normal-form games (NFGs),
doing so comes at the cost of an exponential blowup of the strategy space. So, progress on …

被引用次数：32 相关文章所有 4 个版本

[PDF] neurips.cc

Last-iterate convergence in extensive-form games

CW Lee, C Kroer, H Luo - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Regret-based algorithms are highly efficient at finding approximate Nash equilibria in
sequential games such as poker games. However, most regret-based algorithms, including …

被引用次数：47 相关文章所有 6 个版本

[PDF] aaai.org

Faster game solving via predictive blackwell approachability: Connecting regret matching and mirror descent

G Farina, C Kroer, T Sandholm - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org

Blackwell approachability is a framework for reasoning about repeated games with vector-
valued payoffs. We introduce predictive Blackwell approachability, where an estimate of the …

被引用次数：76 相关文章所有 8 个版本

[PDF] neurips.cc

Block-coordinate methods and restarting for solving extensive-form games

D Chakrabarti, J Diakonikolas… - Advances in Neural …, 2023 - proceedings.neurips.cc

Coordinate descent methods are popular in machine learning and optimization for their
simple sparse updates and excellent practical performance. In the context of large-scale …

被引用次数：5 相关文章所有 5 个版本

高级搜索

QQ 群