Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

N Silva, H Werneck, T Silva, ACM Pereira… - Expert Systems with …, 2022 - Elsevier
Abstract Recommender Systems (RSs) have assumed a crucial role in several digital
companies by directly affecting their key performance indicators. Nowadays, in this era of big …

Federated bandit: A gossiping approach

Z Zhu, J Zhu, J Liu, Y Liu - Proceedings of the ACM on Measurement …, 2021 - dl.acm.org
In this paper, we study Federated Bandit, a decentralized Multi-Armed Bandit problem with a
set of N agents, who can only communicate their local data with neighbors described by a …

No peek: A survey of private distributed deep learning

P Vepakomma, T Swedish, R Raskar, O Gupta… - arXiv preprint arXiv …, 2018 - arxiv.org
We survey distributed deep learning models for training or inference without accessing raw
data from clients. These methods aim to protect confidential patterns in data while still …

Multi-armed bandits with local differential privacy

W Ren, X Zhou, J Liu, NB Shroff - arXiv preprint arXiv:2007.03121, 2020 - arxiv.org
This paper investigates the problem of regret minimization for multi-armed bandit (MAB)
problems with local differential privacy (LDP) guarantee. In stochastic bandit systems, the …

Differentially private Bayesian optimization

M Kusner, J Gardner, R Garnett… - … on machine learning, 2015 - proceedings.mlr.press
Bayesian optimization is a powerful tool for fine-tuning the hyper-parameters of a wide
variety of machine learning models. The success of machine learning has led practitioners …

Mitigating bias in adaptive data gathering via differential privacy

S Neel, A Roth - International Conference on Machine …, 2018 - proceedings.mlr.press
Data that is gathered adaptively—via bandit algorithms, for example—exhibits bias. This is
true both when gathering simple numeric valued data—the empirical means kept track of by …

[PDF][PDF] Differentially private, multi-agent multi-armed bandits

ACY Tossou, C Dimitrakakis - European Workshop on …, 2015 - academia.edu
Abstract 1 We study the problem of privacy for distributed learning in Multi-Armed bandit
(MAB) problem with multiple players. The players must co-ordinate, as choosing the same …

Active learning in contextual bandits: handling the uncertainty about the user's preferences in interactive recommendation systems

N de Campos Silva - 2023 - repositorio.ufmg.br
Abstract Nowadays, Recommendation Systems (RSs) have been concerned about the
online environment of real-world applications where the system should continually learn and …