Private stochastic multi-arm bandits: From theory to practice

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

N Silva, H Werneck, T Silva, ACM Pereira… - Expert Systems with …, 2022 - Elsevier

Abstract Recommender Systems (RSs) have assumed a crucial role in several digital
companies by directly affecting their key performance indicators. Nowadays, in this era of big …

被引用次数：60 相关文章所有 2 个版本

[PDF] acm.org

Federated bandit: A gossiping approach

Z Zhu, J Zhu, J Liu, Y Liu - Proceedings of the ACM on Measurement …, 2021 - dl.acm.org

In this paper, we study Federated Bandit, a decentralized Multi-Armed Bandit problem with a
set of N agents, who can only communicate their local data with neighbors described by a …

被引用次数：86 相关文章所有 8 个版本

[PDF] arxiv.org

No peek: A survey of private distributed deep learning

P Vepakomma, T Swedish, R Raskar, O Gupta… - arXiv preprint arXiv …, 2018 - arxiv.org

We survey distributed deep learning models for training or inference without accessing raw
data from clients. These methods aim to protect confidential patterns in data while still …

被引用次数：97 相关文章所有 4 个版本

[PDF] arxiv.org

Multi-armed bandits with local differential privacy

W Ren, X Zhou, J Liu, NB Shroff - arXiv preprint arXiv:2007.03121, 2020 - arxiv.org

This paper investigates the problem of regret minimization for multi-armed bandit (MAB)
problems with local differential privacy (LDP) guarantee. In stochastic bandit systems, the …

被引用次数：49 相关文章所有 2 个版本

[PDF] mlr.press

Differentially private Bayesian optimization

M Kusner, J Gardner, R Garnett… - … on machine learning, 2015 - proceedings.mlr.press

Bayesian optimization is a powerful tool for fine-tuning the hyper-parameters of a wide
variety of machine learning models. The success of machine learning has led practitioners …

被引用次数：68 相关文章所有 13 个版本

[PDF] mlr.press

Mitigating bias in adaptive data gathering via differential privacy

S Neel, A Roth - International Conference on Machine …, 2018 - proceedings.mlr.press

Data that is gathered adaptively—via bandit algorithms, for example—exhibits bias. This is
true both when gathering simple numeric valued data—the empirical means kept track of by …

被引用次数：40 相关文章所有 5 个版本

[PDF] academia.edu

[PDF][PDF] Differentially private, multi-agent multi-armed bandits

ACY Tossou, C Dimitrakakis - European Workshop on …, 2015 - academia.edu

Abstract 1 We study the problem of privacy for distributed learning in Multi-Armed bandit
(MAB) problem with multiple players. The players must co-ordinate, as choosing the same …

被引用次数：10 相关文章所有 2 个版本

[PDF] ufmg.br

Active learning in contextual bandits: handling the uncertainty about the user's preferences in interactive recommendation systems

N de Campos Silva - 2023 - repositorio.ufmg.br

Abstract Nowadays, Recommendation Systems (RSs) have been concerned about the
online environment of real-world applications where the system should continually learn and …

高级搜索

QQ 群