I Demirel,
MU Ozdemir, C Tekin - arXiv preprint arXiv:2112.06728, 2021 - arxiv.org
Multi-armed bandits (MAB) are extensively studied in various settings where the objective is
to\textit {maximize} the actions' outcomes (ie, rewards) over time. Since safety is crucial in …