RC Broek, R Litjens, T Sagis, N Verbeeke… - … Symposium on Intelligent …, 2024 - Springer
Decision-making problems of sequential nature, where decisions made in the past may have an impact on the future, are used to model many practically important applications. In …
RC Broek, R Litjens, T Sagis, L Siecker… - arXiv preprint arXiv …, 2022 - arxiv.org
We investigate the Multi-Armed Bandit problem with Temporally-Partitioned Rewards (TP- MAB) setting in this paper. In the TP-MAB setting, an agent will receive subsets of the reward …
This thesis revolves around the problem of selling and advertising products on the Web and exploits techniques from the fields of algorithmic game theory, mechanism design, and …
This work pertains to the field of Multi-Armed-Bandits (MAB), a framework in online learning where an agent sequentially chooses from a set of available actions, called arms, and …