Federated linear contextual bandits

R Huang, W Wu, J Yang… - Advances in neural …, 2021 - proceedings.neurips.cc
This paper presents a novel federated linear contextual bandits model, where individual
clients face different $ K $-armed stochastic bandits coupled through common global …

Federated multi-armed bandits with personalization

C Shi, C Shen, J Yang - International Conference on Artificial …, 2021 - proceedings.mlr.press
A general framework of personalized federated multi-armed bandits (PF-MAB) is proposed,
which is a new bandit paradigm analogous to the federated learning (FL) framework in …

Federated multi-armed bandits

C Shi, C Shen - Proceedings of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
Federated multi-armed bandits (FMAB) is a new bandit paradigm that parallels the federated
learning (FL) framework in supervised learning. It is inspired by practical applications in …

Heterogeneous multi-player multi-armed bandits: Closing the gap and generalization

C Shi, W Xiong, C Shen, J Yang - Advances in neural …, 2021 - proceedings.neurips.cc
Despite the significant interests and many progresses in decentralized multi-player multi-
armed bandits (MP-MAB) problems in recent years, the regret gap to the natural centralized …

Multiplayer bandits without observing collision information

G Lugosi, A Mehrabian - Mathematics of Operations …, 2022 - pubsonline.informs.org
We study multiplayer stochastic multiarmed bandit problems in which the players cannot
communicate, and if two or more players pull the same arm, a collision occurs and the …

Finite-time frequentist regret bounds of multi-agent thompson sampling on sparse hypergraphs

T Jin, HL Hsu, W Chang, P Xu - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
We study the multi-agent multi-armed bandit (MAMAB) problem, where agents are factored
into overlapping groups. Each group represents a hyperedge, forming a hypergraph over …

Matching in multi-arm bandit with collision

Y Zhang, S Wang, Z Fang - Advances in Neural Information …, 2022 - proceedings.neurips.cc
In this paper, we consider the matching of multi-agent multi-armed bandit problem, ie, while
agents prefer arms with higher expected reward, arms also have preferences on agents. In …

Incentivized communication for federated bandits

Z Wei, C Li, H Xu, H Wang - Advances in Neural …, 2023 - proceedings.neurips.cc
Most existing works on federated bandits take it for granted that all clients are altruistic about
sharing their data with the server for the collective good whenever needed. Despite their …

Competing for shareable arms in multi-player multi-armed bandits

R Xu, H Wang, X Zhang, B Li… - … Conference on Machine …, 2023 - proceedings.mlr.press
Competitions for shareable and limited resources have long been studied with strategic
agents. In reality, agents often have to learn and maximize the rewards of the resources at …

PPA-Game: Characterizing and Learning Competitive Dynamics Among Online Content Creators

R Xu, H Wang, X Zhang, B Li, P Cui - arXiv preprint arXiv:2403.15524, 2024 - arxiv.org
We introduce the Proportional Payoff Allocation Game (PPA-Game) to model how agents,
akin to content creators on platforms like YouTube and TikTok, compete for divisible …