Linear bandits with memory: from rotting to rising

D Simchi-Levi, C Wang… - Advances in Neural …, 2023 - proceedings.neurips.cc

Experimentation has been critical and increasingly popular across various domains, such as
clinical trials and online platforms, due to its widely recognized benefits. One of the primary …

被引用次数：5 相关文章所有 3 个版本

[PDF] mlr.press

Revisiting weighted strategy for non-stationary parametric bandits

J Wang, P Zhao, ZH Zhou - International Conference on …, 2023 - proceedings.mlr.press

Non-stationary parametric bandits have attracted much attention recently. There are three
principled ways to deal with non-stationarity, including sliding-window, weighted, and restart …

被引用次数：7 相关文章所有 6 个版本

[PDF] openreview.net

Thompson sampling-like algorithms for stochastic rising rested bandits

M Fiandri, AM Metelli, F Trovò - Seventeenth European Workshop …, 2024 - openreview.net

Stochastic rising rested bandit (SRRB) is a specific bandit setting where the arms' expected
rewards increase as they are pulled. They model scenarios in which the performances of the …

被引用次数：1 相关文章

[PDF] arxiv.org

高级搜索

QQ 群