S Bhatt, G Fang, P Li - International Conference on Artificial …, 2023 - proceedings.mlr.press
Piecewise stationary stochastic multi-armed bandits have been extensively explored in the risk-neutral and sub-Gaussian setting. In this work, we consider a multi-armed bandit …
This paper considers an empirical risk minimization problem under heavy-tailed settings, where data does not have finite variance, but only has $ p $-th moment with $ p\in (1, 2) …
Y Luo, C Gao - arXiv preprint arXiv:2410.22647, 2024 - arxiv.org
This paper studies the construction of adaptive confidence intervals under Huber's contamination model when the contamination proportion is unknown. For the robust …
Proposed in Hyv\" arinen (2005), score matching is a parameter estimation procedure that does not require computation of distributional normalizing constants. In this work we utilize …
X Zhou, W Zhang - The Thirty-eighth Annual Conference on Neural … - openreview.net
We study the interplay between local differential privacy (LDP) and robustness to Huber corruption and possibly heavy-tailed rewards in the context of multi-armed bandits (MABs) …