Fast policy extragradient methods for competitive games with entropy regularization

S Cen, Y Wei, Y Chi - Advances in Neural Information …, 2021 - proceedings.neurips.cc
This paper investigates the problem of computing the equilibrium of competitive games,
which is often modeled as a constrained saddle-point optimization problem with probability …

Maximum-entropy multi-agent dynamic games: Forward and inverse solutions

N Mehr, M Wang, M Bhatt… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
In this article, we study the problem of multiple stochastic agents interacting in a dynamic
game scenario with continuous state and action spaces. We define a new notion of …

Exploratory LQG mean field games with entropy regularization

D Firoozi, S Jaimungal - Automatica, 2022 - Elsevier
We study a general class of entropy-regularized multi-variate LQG mean field game (MFG)
systems in continuous time with K distinct subpopulations of agents. We extend the notion of …

Reinforcement learning for exploratory linear-quadratic two-person zero-sum stochastic differential games

Z Sun, G Jia - Applied Mathematics and Computation, 2023 - Elsevier
In this paper, we study an entropy-regularized continuous-time linear-quadratic two-person
zero-sum stochastic differential game problem from the perspective of reinforcement …

Fast computation of optimal transport via entropy-regularized extragradient methods

G Li, Y Chen, Y Huang, Y Chi, HV Poor… - arXiv preprint arXiv …, 2023 - arxiv.org
Efficient computation of the optimal transport distance between two distributions serves as
an algorithm subroutine that empowers various applications. This paper develops a scalable …

Exploratory control with Tsallis entropy for latent factor models

R Donnelly, S Jaimungal - SIAM Journal on Financial Mathematics, 2024 - SIAM
We study optimal control in models with latent factors where the agent controls the
distribution over actions, rather than actions themselves, in both discrete and continuous …

A Coupled Optimization Framework for Correlated Equilibria in Normal-Form Game

SHQ Li, Y Yu, F Dörfler, J Lygeros - arXiv preprint arXiv:2403.16223, 2024 - arxiv.org
In competitive multi-player interactions, simultaneous optimality is a key requirement for
establishing strategic equilibria. This property is explicit when the game-theoretic …

Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games

RJ Qin, FM Luo, H Qian, Y Yu - arXiv preprint arXiv:2208.09452, 2022 - arxiv.org
This paper addresses policy learning in non-stationary environments and games with
continuous actions. Rather than the classical reward maximization mechanism, inspired by …

Algorithmic Foundations of Policy Optimization in Reinforcement Learning, Multi-agent Systems, and AI Alignment

S Cen - 2024 - search.proquest.com
Reinforcement learning (RL) aims to solve various tasks by modeling them as learning and
sequential decision-making problems within an unknown environment. The empirical …

Convergence Analysis of Minimax Optimization and Multiagent Reinforcement Learning

Z Chen - 2023 - search.proquest.com
This dissertation investigates two popular machine learning frameworks, namely, minimax
optimization and multiagent reinforcement learning (MARL). There are a large number of …