Neural contextual bandits without regret

P Kassraie, A Krause - International Conference on Artificial …, 2022 - proceedings.mlr.press
Contextual bandits are a rich model for sequential decision making given side information,
with important applications, eg, in recommender systems. We propose novel algorithms for …

Model-based causal Bayesian optimization

S Sussex, A Makarova, A Krause - arXiv preprint arXiv:2211.10257, 2022 - arxiv.org
How should we intervene on an unknown structural equation model to maximize a
downstream variable of interest? This setting, also known as causal Bayesian optimization …

On the interplay between social welfare and tractability of equilibria

I Anagnostides, T Sandholm - Advances in Neural …, 2024 - proceedings.neurips.cc
Computational tractability and social welfare (aka. efficiency) of equilibria are two
fundamental but in general orthogonal considerations in algorithmic game theory …

Movement penalized Bayesian optimization with application to wind energy systems

SS Ramesh, PG Sessa, A Krause… - Advances in Neural …, 2022 - proceedings.neurips.cc
Contextual Bayesian optimization (CBO) is a powerful framework for sequential decision-
making given side information, with important applications, eg, in wind energy systems. In …

Three-operator splitting for learning to predict equilibria in convex games

D McKenzie, H Heaton, Q Li, S Wu Fung, S Osher… - SIAM Journal on …, 2024 - SIAM
Systems of competing agents can often be modeled as games. Assuming rationality, the
most likely outcomes are given by an equilibrium, eg, a Nash equilibrium. In many practical …

Reinforcement learning meets network intrusion detection: a transferable and adaptable framework for anomaly behavior identification

M He, X Wang, P Wei, L Yang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Anomaly detection plays an essential role in network security and traffic classification. Many
studies have focused on anomaly detection to improve network security, including machine …

[PDF][PDF] Learn to predict equilibria via fixed point networks

H Heaton, D McKenzie, Q Li, SW Fung… - arXiv preprint arXiv …, 2021 - researchgate.net
Abstract Systems of interacting agents can often be modeled as contextual games, where
the context encodes additional information, beyond the control of any agent (eg weather for …

Efficient model-based multi-agent reinforcement learning via optimistic equilibrium computation

PG Sessa, M Kamgarpour… - … Conference on Machine …, 2022 - proceedings.mlr.press
We consider model-based multi-agent reinforcement learning, where the environment
transition model is unknown and can only be learned via expensive interactions with the …

Are equivariant equilibrium approximators beneficial?

Z Duan, Y Ma, X Deng - International Conference on …, 2023 - proceedings.mlr.press
Recently, remarkable progress has been made by approximating Nash equilibrium (NE),
correlated equilibrium (CE), and coarse correlated equilibrium (CCE) through function …

Adversarial Causal Bayesian Optimization

S Sussex, PG Sessa, A Makarova… - The Twelfth International …, 2023 - openreview.net
In Causal Bayesian Optimization (CBO), an agent intervenes on a structural causal model
with known graph but unknown mechanisms to maximize a downstream reward variable. In …