Decentralized multi-player multi-armed bandits with no collision information

R Huang, W Wu, J Yang… - Advances in neural …, 2021 - proceedings.neurips.cc

This paper presents a novel federated linear contextual bandits model, where individual
clients face different $ K $-armed stochastic bandits coupled through common global …

被引用次数：84 相关文章所有 9 个版本

[PDF] mlr.press

Federated multi-armed bandits with personalization

C Shi, C Shen, J Yang - International Conference on Artificial …, 2021 - proceedings.mlr.press

A general framework of personalized federated multi-armed bandits (PF-MAB) is proposed,
which is a new bandit paradigm analogous to the federated learning (FL) framework in …

被引用次数：95 相关文章所有 7 个版本

[PDF] aaai.org

Federated multi-armed bandits

C Shi, C Shen - Proceedings of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org

Federated multi-armed bandits (FMAB) is a new bandit paradigm that parallels the federated
learning (FL) framework in supervised learning. It is inspired by practical applications in …

被引用次数：100 相关文章所有 8 个版本

[PDF] neurips.cc

Heterogeneous multi-player multi-armed bandits: Closing the gap and generalization

C Shi, W Xiong, C Shen, J Yang - Advances in neural …, 2021 - proceedings.neurips.cc

Despite the significant interests and many progresses in decentralized multi-player multi-
armed bandits (MP-MAB) problems in recent years, the regret gap to the natural centralized …

被引用次数：25 相关文章所有 9 个版本

[PDF] arxiv.org

Multiplayer bandits without observing collision information

G Lugosi, A Mehrabian - Mathematics of Operations …, 2022 - pubsonline.informs.org

We study multiplayer stochastic multiarmed bandit problems in which the players cannot
communicate, and if two or more players pull the same arm, a collision occurs and the …

被引用次数：55 相关文章所有 6 个版本

[PDF] aaai.org

Finite-time frequentist regret bounds of multi-agent thompson sampling on sparse hypergraphs

T Jin, HL Hsu, W Chang, P Xu - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

We study the multi-agent multi-armed bandit (MAMAB) problem, where agents are factored
into overlapping groups. Each group represents a hyperedge, forming a hypergraph over …

被引用次数：3 相关文章所有 4 个版本

[PDF] neurips.cc

Matching in multi-arm bandit with collision

Y Zhang, S Wang, Z Fang - Advances in Neural Information …, 2022 - proceedings.neurips.cc

In this paper, we consider the matching of multi-agent multi-armed bandit problem, ie, while
agents prefer arms with higher expected reward, arms also have preferences on agents. In …

被引用次数：14 相关文章所有 4 个版本

[PDF] neurips.cc

Incentivized communication for federated bandits

Z Wei, C Li, H Xu, H Wang - Advances in Neural …, 2023 - proceedings.neurips.cc

Most existing works on federated bandits take it for granted that all clients are altruistic about
sharing their data with the server for the collective good whenever needed. Despite their …

被引用次数：3 相关文章所有 5 个版本

[PDF] mlr.press

Competing for shareable arms in multi-player multi-armed bandits

R Xu, H Wang, X Zhang, B Li… - … Conference on Machine …, 2023 - proceedings.mlr.press

Competitions for shareable and limited resources have long been studied with strategic
agents. In reality, agents often have to learn and maximize the rewards of the resources at …

被引用次数：3 相关文章所有 7 个版本

[PDF] arxiv.org

PPA-Game: Characterizing and Learning Competitive Dynamics Among Online Content Creators

R Xu, H Wang, X Zhang, B Li, P Cui - arXiv preprint arXiv:2403.15524, 2024 - arxiv.org

We introduce the Proportional Payoff Allocation Game (PPA-Game) to model how agents,
akin to content creators on platforms like YouTube and TikTok, compete for divisible …

被引用次数：2 相关文章所有 2 个版本

高级搜索

QQ 群