Differentiable bilevel programming for stackelberg congestion games

J Li, J Yu, Q Wang, B Liu, Z Wang, YM Nie - arXiv preprint arXiv …, 2022 - arxiv.org
A Stackelberg congestion game (SCG) is a bilevel program in which a leader aims to
maximize their own gain by anticipating and manipulating the equilibrium state at which …

Polyak–Łojasiewicz inequality on the space of measures and convergence of mean-field birth-death processes

L Liu, MB Majka, Ł Szpruch - Applied Mathematics & Optimization, 2023 - Springer
The Polyak–Łojasiewicz inequality (PŁI) in R d is a natural condition for proving
convergence of gradient descent algorithms (Karimi et al. in: Frasconi et al.(eds) Machine …

On centralized and distributed mirror descent: Convergence analysis using quadratic constraints

Y Sun, M Fazlyab… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Mirror descent (MD) is a powerful first-order optimization technique that subsumes several
optimization algorithms including gradient descent (GD). In this work, we leverage quadratic …

A fast adaptive online gradient descent algorithm in over-parameterized neural networks

A Yang, D Li, G Li - Neural Processing Letters, 2023 - Springer
In recent years, deep learning has dramatically improved state of the art in many practical
applications. However, this utility is highly dependent on fine-tuning of hyperparameters …

Configurable Mirror Descent: Towards a Unification of Decision Making

P Li, S Li, C Yang, X Wang, S Hu, X Huang… - arXiv preprint arXiv …, 2024 - arxiv.org
Decision-making problems, categorized as single-agent, eg, Atari, cooperative multi-agent,
eg, Hanabi, competitive multi-agent, eg, Hold'em poker, and mixed cooperative and …

A Mirror Descent Perspective of Smoothed Sign Descent

S Wang, D Klabjan - arXiv preprint arXiv:2410.14158, 2024 - arxiv.org
Recent work by Woodworth et al.(2020) shows that the optimization dynamics of gradient
descent for overparameterized problems can be viewed as low-dimensional dual dynamics …

Reusing combinatorial structure: faster iterative projections over submodular base polytopes

J Moondra, H Mortagy, S Gupta - Advances in Neural …, 2021 - proceedings.neurips.cc
Optimization algorithms such as projected Newton's method, FISTA, mirror descent and its
variants enjoy near-optimal regret bounds and convergence rates, but suffer from a …

First-Order Algorithms for Optimization over Graph Laplacians

T Maunu - 2023 International Conference on Sampling Theory …, 2023 - ieeexplore.ieee.org
When solving an optimization problem over the set of graph Laplacian matrices, one must
deal with a large number of constraints as well as the large objective variable size. In this …

Characterizing linear convergence in optimization: Polyak-Łojasiewicz inequality and weak-quasi-strong-convexity

F Alimisis, B Vandereycken - openreview.net
We give a complete characterization of optimization problems that can be solved by gradient
descent with a linear convergence rate. We show that the well-known Polyak-Łojasiewicz …