The classical ski-rental problem admits a textbook 2-competitive deterministic algorithm, and a simple randomized algorithm that is e/e-1-competitive in expectation. The randomized …
We study the survival bandit problem, a variant of the multi-armed bandit problem with a constraint on the cumulative reward; at each time step, the agent receives a reward in [-1, 1] …
In this paper we consider a particular class of problems called multiarmed gambler bandits (MAGB) which constitutes a modified version of the Bernoulli MAB problem where two new …
N Manome, S Shinohara, U Chung - arXiv preprint arXiv:2308.14350, 2023 - arxiv.org
The multi-armed bandit (MAB) problem is a classical problem that models sequential decision-making under uncertainty in reinforcement learning. In this study, we propose a …
FS Perotto, M Nargeot, A Ouahbi - International Conference on …, 2024 - Springer
Abstract Survival Reinforcement Learning is a specific type of RL problem constrained by a risk of ruin. The underlying stochastic sequential decision process with which the agent …
X Wang, H Xie, P Wang, JCS Lui - Performance Evaluation, 2023 - Elsevier
User abandonment behaviors are quite common in recommendation applications such as online shopping recommendation and news recommendation. To maximize its total “reward” …
P Veroutis, F Godin - arXiv preprint arXiv:2410.16486, 2024 - arxiv.org
The Multiarmed Bandits (MAB) problem has been extensively studied and has seen many practical applications in a variety of fields. The Survival Multiarmed Bandits (S-MAB) open …
In this paper we consider Multi-Armed Gambler Bandits (MAGB), a stochastic random process in which an agent performs successive actions and either loses 1 unit from its …
In this paper we consider a particular class of problems called multiarmed gambler bandits (MAGB) which constitutes a modified version of the Bernoulli MAB problem where two new …