Locally differentially private (contextual) bandits learning

D Yu, H Zhang, W Chen, J Yin… - … Conference on Machine …, 2021 - proceedings.mlr.press

We propose a reparametrization scheme to address the challenges of applying differentially
private SGD on large neural networks, which are 1) the huge memory cost of storing …

被引用次数：82 相关文章所有 5 个版本

[PDF] arxiv.org

Privacy-preserving dynamic personalized pricing with demand learning

X Chen, D Simchi-Levi, Y Wang - Management Science, 2022 - pubsonline.informs.org

The prevalence of e-commerce has made customers' detailed personal information readily
accessible to retailers, and this information has been widely used in pricing decisions. When …

被引用次数：73 相关文章所有 9 个版本

[PDF] neurips.cc

Optimal order simple regret for Gaussian process bandits

S Vakili, N Bouziani, S Jalali… - Advances in Neural …, 2021 - proceedings.neurips.cc

Consider the sequential optimization of a continuous, possibly non-convex, and expensive
to evaluate objective function $ f $. The problem can be cast as a Gaussian Process (GP) …

被引用次数：40 相关文章所有 7 个版本

[PDF] mlr.press

Federated linear contextual bandits with user-level differential privacy

R Huang, H Zhang, L Melis, M Shen… - International …, 2023 - proceedings.mlr.press

This paper studies federated linear contextual bandits under the notion of user-level
differential privacy (DP). We first introduce a unified federated bandits framework that can …

被引用次数：13 相关文章所有 8 个版本

[PDF] arxiv.org

Zeroth-order optimization meets human feedback: Provable learning via ranking oracles

Z Tang, D Rybin, TH Chang - arXiv preprint arXiv:2303.03751, 2023 - arxiv.org

In this study, we delve into an emerging optimization challenge involving a black-box
objective function that can only be gauged via a ranking oracle-a situation frequently …

被引用次数：14 相关文章所有 4 个版本

[PDF] neurips.cc

When privacy meets partial information: A refined analysis of differentially private bandits

A Azize, D Basu - Advances in Neural Information …, 2022 - proceedings.neurips.cc

We study the problem of multi-armed bandits with ε-global Differential Privacy (DP). First, we
prove the minimax and problem-dependent regret lower bounds for stochastic and linear …

被引用次数：16 相关文章所有 10 个版本

[PDF] arxiv.org

Differential privacy in personalized pricing with nonparametric demand models

X Chen, S Miao, Y Wang - Operations Research, 2023 - pubsonline.informs.org

In recent decades, the advance of information technology and abundant personal data
facilitate the application of algorithmic personalized pricing. However, this leads to the …

被引用次数：29 相关文章所有 9 个版本

[PDF] neurips.cc

Local differential privacy for regret minimization in reinforcement learning

E Garcelon, V Perchet… - Advances in Neural …, 2021 - proceedings.neurips.cc

Reinforcement learning algorithms are widely used in domains where it is desirable to
provide a personalized service. In these domains it is common that user data contains …

被引用次数：36 相关文章所有 9 个版本

[PDF] neurips.cc

Differentially private multi-armed bandits in the shuffle model

J Tenenbaum, H Kaplan, Y Mansour… - Advances in Neural …, 2021 - proceedings.neurips.cc

Abstract We give an $(\varepsilon,\delta) $-differentially private algorithm for the Multi-Armed
Bandit (MAB) problem in the shuffle model with a distribution-dependent regret of $ O\left …

被引用次数：29 相关文章所有 10 个版本

[PDF] arxiv.org

Differentially private reinforcement learning with linear function approximation

X Zhou - Proceedings of the ACM on Measurement and Analysis …, 2022 - dl.acm.org

Motivated by the wide adoption of reinforcement learning (RL) in real-world personalized
services, where users' sensitive and private information needs to be protected, we study …

被引用次数：23 相关文章所有 6 个版本

高级搜索

QQ 群