Almost optimal batch-regret tradeoff for batch linear contextual bandits

Efficient batched algorithm for contextual linear bandits with large action space via soft elimination

O Hanna, L Yang, C Fragouli - Advances in Neural …, 2024 - proceedings.neurips.cc

In this paper, we provide the first efficient batched algorithm for contextual linear bandits with
large action spaces. Unlike existing batched algorithms that rely on action elimination, which …

被引用次数：7 相关文章所有 6 个版本

[PDF] mlr.press

Contexts can be cheap: Solving stochastic contextual bandits with linear bandit algorithms

OA Hanna, L Yang, C Fragouli - The Thirty Sixth Annual …, 2023 - proceedings.mlr.press

In this paper, we address the stochastic contextual linear bandit problem, where a decision
maker is provided a context (a random set of actions drawn from a distribution). The …

被引用次数：14 相关文章所有 5 个版本

[PDF] mlr.press

CO-BED: information-theoretic contextual optimization via Bayesian experimental design

DR Ivanova, J Jennings, T Rainforth… - International …, 2023 - proceedings.mlr.press

We formalize the problem of contextual optimization through the lens of Bayesian
experimental design and propose CO-BED—a general, model-agnostic framework for …

被引用次数：5 相关文章所有 6 个版本

[PDF] wiley.com Full View

被引用次数：2 相关文章所有 3 个版本

[PDF] escholarship.org

Communication and Computationally Efficient Learning Algorithms

OAH Habib - 2024 - search.proquest.com

The growing availability of data and rapid advancements in machine learning are
revolutionizing decision-making. Often, these data come from distributed devices with low …

高级搜索

QQ 群