Large scale private learning via low-rank reparametrization

D Yu, H Zhang, W Chen, J Yin… - … Conference on Machine …, 2021 - proceedings.mlr.press
We propose a reparametrization scheme to address the challenges of applying differentially
private SGD on large neural networks, which are 1) the huge memory cost of storing …

Privacy-preserving dynamic personalized pricing with demand learning

X Chen, D Simchi-Levi, Y Wang - Management Science, 2022 - pubsonline.informs.org
The prevalence of e-commerce has made customers' detailed personal information readily
accessible to retailers, and this information has been widely used in pricing decisions. When …

Optimal order simple regret for Gaussian process bandits

S Vakili, N Bouziani, S Jalali… - Advances in Neural …, 2021 - proceedings.neurips.cc
Consider the sequential optimization of a continuous, possibly non-convex, and expensive
to evaluate objective function $ f $. The problem can be cast as a Gaussian Process (GP) …

Federated linear contextual bandits with user-level differential privacy

R Huang, H Zhang, L Melis, M Shen… - International …, 2023 - proceedings.mlr.press
This paper studies federated linear contextual bandits under the notion of user-level
differential privacy (DP). We first introduce a unified federated bandits framework that can …

Zeroth-order optimization meets human feedback: Provable learning via ranking oracles

Z Tang, D Rybin, TH Chang - arXiv preprint arXiv:2303.03751, 2023 - arxiv.org
In this study, we delve into an emerging optimization challenge involving a black-box
objective function that can only be gauged via a ranking oracle-a situation frequently …

When privacy meets partial information: A refined analysis of differentially private bandits

A Azize, D Basu - Advances in Neural Information …, 2022 - proceedings.neurips.cc
We study the problem of multi-armed bandits with ε-global Differential Privacy (DP). First, we
prove the minimax and problem-dependent regret lower bounds for stochastic and linear …

Differential privacy in personalized pricing with nonparametric demand models

X Chen, S Miao, Y Wang - Operations Research, 2023 - pubsonline.informs.org
In recent decades, the advance of information technology and abundant personal data
facilitate the application of algorithmic personalized pricing. However, this leads to the …

Local differential privacy for regret minimization in reinforcement learning

E Garcelon, V Perchet… - Advances in Neural …, 2021 - proceedings.neurips.cc
Reinforcement learning algorithms are widely used in domains where it is desirable to
provide a personalized service. In these domains it is common that user data contains …

Differentially private multi-armed bandits in the shuffle model

J Tenenbaum, H Kaplan, Y Mansour… - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract We give an $(\varepsilon,\delta) $-differentially private algorithm for the Multi-Armed
Bandit (MAB) problem in the shuffle model with a distribution-dependent regret of $ O\left …

Differentially private reinforcement learning with linear function approximation

X Zhou - Proceedings of the ACM on Measurement and Analysis …, 2022 - dl.acm.org
Motivated by the wide adoption of reinforcement learning (RL) in real-world personalized
services, where users' sensitive and private information needs to be protected, we study …