On the bias, risk, and consistency of sample means in multi-armed bandits

T Manole, A Ramdas - IEEE Transactions on Information …, 2023 - ieeexplore.ieee.org

We present a unified technique for sequential estimation of convex divergences between
distributions, including integral probability metrics like the kernel maximum mean …

被引用次数：38 相关文章所有 6 个版本

[PDF] neurips.cc

A unified framework for bandit multiple testing

Z Xu, R Wang, A Ramdas - Advances in Neural Information …, 2021 - proceedings.neurips.cc

In bandit multiple hypothesis testing, each arm corresponds to a different null hypothesis that
we wish to test, and the goal is to design adaptive algorithms that correctly identify large set …

被引用次数：18 相关文章所有 9 个版本

[PDF] aaai.org

Integrating reward maximization and population estimation: Sequential decision-making for Internal Revenue Service audit selection

P Henderson, B Chugg, B Anderson… - Proceedings of the …, 2023 - ojs.aaai.org

We introduce a new setting, optimize-and-estimate structured bandits. Here, a policy must
select a batch of arms, each characterized by its own context, that would allow it to both …

被引用次数：10 相关文章所有 5 个版本

[PDF] neurips.cc

Statistical limits of adaptive linear models: low-dimensional estimation and inference

L Lin, M Ying, S Ghosh, K Khamaru… - Advances in Neural …, 2023 - proceedings.neurips.cc

Estimation and inference in statistics pose significant challenges when data are collected
adaptively. Even in linear models, the Ordinary Least Squares (OLS) estimator may fail to …

被引用次数：2 相关文章所有 8 个版本

[PDF] archive.org

Multi armed bandit vs. a/b tests in e-commerce-confidence interval and hypothesis test power perspectives

D Xiang, R West, J Wang, X Cui, J Huang - Proceedings of the 28th ACM …, 2022 - dl.acm.org

An emerging dilemma that faces practitioners in large scale online experimentation for e-
commerce is whether to use Multi-Armed Bandit (MAB) algorithms for testing or traditional …

被引用次数：13 相关文章所有 2 个版本

[PDF] aaai.org

Entropy regularization for population estimation

B Chugg, P Henderson, J Goldin, DE Ho - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Entropy regularization is known to improve exploration in sequential decision-making
problems. We show that this same mechanism can also lead to nearly unbiased and lower …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams

B Cho, K Gan, N Kallus - arXiv preprint arXiv:2402.06122, 2024 - arxiv.org

We propose a novel nonparametric sequential test for composite hypotheses for means of
multiple data streams. Our proposed method,\emph {peeking with expectation-based …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Optimal Adaptive Experimental Design for Estimating Treatment Effect

J Li, D Simchi-Levi, Y Zhao - arXiv preprint arXiv:2410.05552, 2024 - arxiv.org

Given n experiment subjects with potentially heterogeneous covariates and two possible
treatments, namely active treatment and control, this paper addresses the fundamental …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Beyond ads: sequential decision-making algorithms in law and public policy

P Henderson, B Chugg, B Anderson… - Proceedings of the 2022 …, 2022 - dl.acm.org

We explore the promises and challenges of employing sequential decision-making
algorithms--such as bandits, reinforcement learning, and active learning--in law and public …

被引用次数：10 相关文章所有 4 个版本

[PDF] cmu.edu

[PDF][PDF] Statistical inference for optimal transport

TA Manole - 2024 - kilthub.cmu.edu

Optimal transport is a flexible framework for comparing probability distributions, which has
received a recent surge of interest as a methodological tool in statistics. The aim of this …

被引用次数：2 相关文章所有 3 个版本

高级搜索

QQ 群