F Liu,
N Shroff - International Conference on Machine …, 2019 - proceedings.mlr.press
Stochastic multi-armed bandits form a class of online learning problems that have important
applications in online recommendation systems, adaptive medical treatment, and many …