Fast swap regret minimization and applications to approximate correlated equilibria

B Peng, A Rubinstein - Proceedings of the 56th Annual ACM Symposium …, 2024 - dl.acm.org
We give a simple and computationally efficient algorithm that, for any constant ε> 0, obtains ε
T-swap regret within only T=(n) rounds; this is an exponential improvement compared to the …

Contracting with a learning agent

G Guruganesh, Y Kolumbus, J Schneider… - arXiv preprint arXiv …, 2024 - arxiv.org
Many real-life contractual relations differ completely from the clean, static model at the heart
of principal-agent theory. Typically, they involve repeated strategic interactions of the …

Efficient prior-free mechanisms for no-regret agents

N Collina, A Roth, H Shao - Proceedings of the 25th ACM Conference on …, 2024 - dl.acm.org
We study a repeated Principal Agent problem between a long lived Principal and Agent pair
in a prior free setting. In our setting, the sequence of realized states of nature may be …

Pareto-optimal algorithms for learning in games

ER Arunachaleswaran, N Collina… - Proceedings of the 25th …, 2024 - dl.acm.org
We study the problem of characterizing optimal learning algorithms for playing repeated
games against an adversary with unknown payoffs. In this problem, the first player (called …

Strategically-robust learning algorithms for bidding in first-price auctions

R Kumar, J Schneider, B Sivan - arXiv preprint arXiv:2402.07363, 2024 - arxiv.org
Learning to bid in repeated first-price auctions is a fundamental problem at the interface of
game theory and machine learning, which has seen a recent surge in interest due to the …

Steering no-regret learners to optimal equilibria

BH Zhang, G Farina, I Anagnostides, F Cacciamani… - 2023 - openreview.net
We consider the problem of steering no-regret-learning agents to play desirable equilibria
via nonnegative payments. We show that steering is impossible if the total budget (across …

Maximizing utility in multi-agent environments by anticipating the behavior of other learners

A Assos, Y Dagan, C Daskalakis - arXiv preprint arXiv:2407.04889, 2024 - arxiv.org
Learning algorithms are often used to make decisions in sequential decision-making
environments. In multi-agent settings, the decisions of each agent can affect the …

Steering No-Regret Learners to a Desired Equilibrium

BH Zhang, G Farina, I Anagnostides… - arXiv preprint arXiv …, 2023 - arxiv.org
A mediator observes no-regret learners playing an extensive-form game repeatedly across $
T $ rounds. The mediator attempts to steer players toward some desirable predetermined …

Impact of Decentralized Learning on Player Utilities in Stackelberg Games

K Donahue, N Immorlica, M Jagadeesan… - arXiv preprint arXiv …, 2024 - arxiv.org
When deployed in the world, a learning agent such as a recommender system or a chatbot
often repeatedly interacts with another learning agent (such as a user) over time. In many …

Is Knowledge Power? On the (Im) possibility of Learning from Strategic Interaction

N Ananthakrishnan, N Haghtalab, C Podimata… - arXiv preprint arXiv …, 2024 - arxiv.org
When learning in strategic environments, a key question is whether agents can overcome
uncertainty about their preferences to achieve outcomes they could have achieved absent …