Simple combinatorial algorithms for combinatorial bandits: Corruptions and approximations

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Simple combinatorial algorithms for combinatorial bandits: Corruptions and approximations

在引用文章中搜索

[PDF] mlr.press

Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit

S Nakamura, M Sugiyama - International Conference on …, 2024 - proceedings.mlr.press

We study the real-valued combinatorial pure exploration of the multi-armed bandit in the
fixed-budget setting. We first introduce an algorithm named the Combinatorial Successive …

被引用次数：1 相关文章所有 3 个版本

Truthful Bandit Mechanisms for Repeated Two-stage Ad Auctions

H Li, Y Liu, Z Zheng, Z Zhang, J Xu, F Wu - Proceedings of the 30th ACM …, 2024 - dl.acm.org

Online advertising platforms leverage a two-stage auction architecture to deliver
personalized ads to users with low latency. The first stage efficiently selects a small subset of …

Distributed robust bandits with efficient communication

A Wang, Z Qin, L Zheng, D Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

The Distributed Multi-Armed Bandit (DMAB) is a powerful framework for studying many
network problems. The DMAB is typically studied in a paradigm, where signals activate each …

被引用次数：2 相关文章

[PDF] ccs-labs.org

[PDF][PDF] Robust Matroid Bandit Optimization against Adversarial Contamination

Y Tao, X Cheng, F Dressler, Z Cai, D Yu - ccs-labs.org

In this paper, we consider the matroid bandit optimization problem, a fundamental and
widely applicable framework for combinatorial multi-armed bandits where the action space …

高级搜索

QQ 群

Simple combinatorial algorithms for combinatorial bandits: Corruptions and approximations

Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit

Truthful Bandit Mechanisms for Repeated Two-stage Ad Auctions

Distributed robust bandits with efficient communication

[PDF][PDF] Robust Matroid Bandit Optimization against Adversarial Contamination

引用