Martingale methods for sequential estimation of convex functionals and divergences

T Manole, A Ramdas - IEEE Transactions on Information …, 2023 - ieeexplore.ieee.org
We present a unified technique for sequential estimation of convex divergences between
distributions, including integral probability metrics like the kernel maximum mean …

A unified framework for bandit multiple testing

Z Xu, R Wang, A Ramdas - Advances in Neural Information …, 2021 - proceedings.neurips.cc
In bandit multiple hypothesis testing, each arm corresponds to a different null hypothesis that
we wish to test, and the goal is to design adaptive algorithms that correctly identify large set …

Integrating reward maximization and population estimation: Sequential decision-making for Internal Revenue Service audit selection

P Henderson, B Chugg, B Anderson… - Proceedings of the …, 2023 - ojs.aaai.org
We introduce a new setting, optimize-and-estimate structured bandits. Here, a policy must
select a batch of arms, each characterized by its own context, that would allow it to both …

Statistical limits of adaptive linear models: low-dimensional estimation and inference

L Lin, M Ying, S Ghosh, K Khamaru… - Advances in Neural …, 2023 - proceedings.neurips.cc
Estimation and inference in statistics pose significant challenges when data are collected
adaptively. Even in linear models, the Ordinary Least Squares (OLS) estimator may fail to …

Multi armed bandit vs. a/b tests in e-commerce-confidence interval and hypothesis test power perspectives

D Xiang, R West, J Wang, X Cui, J Huang - Proceedings of the 28th ACM …, 2022 - dl.acm.org
An emerging dilemma that faces practitioners in large scale online experimentation for e-
commerce is whether to use Multi-Armed Bandit (MAB) algorithms for testing or traditional …

Entropy regularization for population estimation

B Chugg, P Henderson, J Goldin, DE Ho - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Entropy regularization is known to improve exploration in sequential decision-making
problems. We show that this same mechanism can also lead to nearly unbiased and lower …

Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams

B Cho, K Gan, N Kallus - arXiv preprint arXiv:2402.06122, 2024 - arxiv.org
We propose a novel nonparametric sequential test for composite hypotheses for means of
multiple data streams. Our proposed method,\emph {peeking with expectation-based …

Optimal Adaptive Experimental Design for Estimating Treatment Effect

J Li, D Simchi-Levi, Y Zhao - arXiv preprint arXiv:2410.05552, 2024 - arxiv.org
Given n experiment subjects with potentially heterogeneous covariates and two possible
treatments, namely active treatment and control, this paper addresses the fundamental …

Beyond ads: sequential decision-making algorithms in law and public policy

P Henderson, B Chugg, B Anderson… - Proceedings of the 2022 …, 2022 - dl.acm.org
We explore the promises and challenges of employing sequential decision-making
algorithms--such as bandits, reinforcement learning, and active learning--in law and public …

[PDF][PDF] Statistical inference for optimal transport

TA Manole - 2024 - kilthub.cmu.edu
Optimal transport is a flexible framework for comparing probability distributions, which has
received a recent surge of interest as a methodological tool in statistics. The aim of this …