Z Xu, R Wang, A Ramdas - Advances in Neural Information …, 2021 - proceedings.neurips.cc
In bandit multiple hypothesis testing, each arm corresponds to a different null hypothesis that we wish to test, and the goal is to design adaptive algorithms that correctly identify large set …
We introduce a new setting, optimize-and-estimate structured bandits. Here, a policy must select a batch of arms, each characterized by its own context, that would allow it to both …
L Lin, M Ying, S Ghosh, K Khamaru… - Advances in Neural …, 2023 - proceedings.neurips.cc
Estimation and inference in statistics pose significant challenges when data are collected adaptively. Even in linear models, the Ordinary Least Squares (OLS) estimator may fail to …
D Xiang, R West, J Wang, X Cui, J Huang - Proceedings of the 28th ACM …, 2022 - dl.acm.org
An emerging dilemma that faces practitioners in large scale online experimentation for e- commerce is whether to use Multi-Armed Bandit (MAB) algorithms for testing or traditional …
Entropy regularization is known to improve exploration in sequential decision-making problems. We show that this same mechanism can also lead to nearly unbiased and lower …
B Cho, K Gan, N Kallus - arXiv preprint arXiv:2402.06122, 2024 - arxiv.org
We propose a novel nonparametric sequential test for composite hypotheses for means of multiple data streams. Our proposed method,\emph {peeking with expectation-based …
J Li, D Simchi-Levi, Y Zhao - arXiv preprint arXiv:2410.05552, 2024 - arxiv.org
Given n experiment subjects with potentially heterogeneous covariates and two possible treatments, namely active treatment and control, this paper addresses the fundamental …
We explore the promises and challenges of employing sequential decision-making algorithms--such as bandits, reinforcement learning, and active learning--in law and public …
Optimal transport is a flexible framework for comparing probability distributions, which has received a recent surge of interest as a methodological tool in statistics. The aim of this …