X Ren,
T Jin,
P Xu - arXiv preprint arXiv:2406.04137, 2024 - arxiv.org
We introduce the E $^ 4$ algorithm for the batched linear bandit problem, incorporating an
Explore-Estimate-Eliminate-Exploit framework. With a proper choice of exploration rate, we …