Locally differentially private reinforcement learning for linear mixture markov decision processes

K Mo, P Ye, X Ren, S Wang, W Li, J Li - ACM Computing Surveys, 2024 - dl.acm.org

Deep Reinforcement Learning (DRL) is an essential subfield of Artificial Intelligence (AI),
where agents interact with environments to learn policies for solving complex tasks. In recent …

被引用次数：6 相关文章

[PDF] arxiv.org

Differentially private reinforcement learning with linear function approximation

X Zhou - Proceedings of the ACM on Measurement and Analysis …, 2022 - dl.acm.org

Motivated by the wide adoption of reinforcement learning (RL) in real-world personalized
services, where users' sensitive and private information needs to be protected, we study …

被引用次数：32 相关文章所有 6 个版本

[PDF] arxiv.org

Shuffle private linear contextual bandits

SR Chowdhury, X Zhou - arXiv preprint arXiv:2202.05567, 2022 - arxiv.org

Differential privacy (DP) has been recently introduced to linear contextual bandits to formally
address the privacy concerns in its associated personalized services to participating users …

被引用次数：23 相关文章所有 4 个版本

[PDF] arxiv.org

Distributed differential privacy in multi-armed bandits

SR Chowdhury, X Zhou - arXiv preprint arXiv:2206.05772, 2022 - arxiv.org

We consider the standard $ K $-armed bandit problem under a distributed trust model of
differential privacy (DP), which enables to guarantee privacy without a trustworthy server …

被引用次数：16 相关文章所有 4 个版本

[PDF] neurips.cc

Offline reinforcement learning with differential privacy

D Qiao, YX Wang - Advances in Neural Information …, 2024 - proceedings.neurips.cc

The offline reinforcement learning (RL) problem is often motivated by the need to learn data-
driven decision policies in financial, legal and healthcare applications. However, the learned …

被引用次数：20 相关文章所有 7 个版本

[PDF] mlr.press

Near-optimal differentially private reinforcement learning

D Qiao, YX Wang - International Conference on Artificial …, 2023 - proceedings.mlr.press

Motivated by personalized healthcare and other applications involving sensitive data, we
study online exploration in reinforcement learning with differential privacy (DP) constraints …

被引用次数：11 相关文章所有 5 个版本

[PDF] arxiv.org

Preserving Expert-Level Privacy in Offline Reinforcement Learning

N Sharma, V Vinod, A Thakurta, A Agarwal… - arXiv preprint arXiv …, 2024 - arxiv.org

The offline reinforcement learning (RL) problem aims to learn an optimal policy from
historical data collected by one or more behavioural policies (experts) by interacting with an …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Differentially private exploration in reinforcement learning with linear representation

P Luyo, E Garcelon, A Lazaric, M Pirotta - arXiv preprint arXiv:2112.01585, 2021 - arxiv.org

This paper studies privacy-preserving exploration in Markov Decision Processes (MDPs)
with linear representation. We first consider the setting of linear-mixture MDPs (Ayoub et al …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Privacy Preserving Reinforcement Learning for Population Processes

S Yang-Zhao, KS Ng - arXiv preprint arXiv:2406.17649, 2024 - arxiv.org

We consider the problem of privacy protection in Reinforcement Learning (RL) algorithms
that operate over population processes, a practical but understudied setting that includes, for …

被引用次数：1 相关文章所有 3 个版本

[PDF] mlr.press

Differentially private episodic reinforcement learning with heavy-tailed rewards

Y Wu, X Zhou, SR Chowdhury… - … Conference on Machine …, 2023 - proceedings.mlr.press

In this paper we study the problem of (finite horizon tabular) Markov decision processes
(MDPs) with heavy-tailed rewards under the constraint of differential privacy (DP) …

被引用次数：1 相关文章所有 7 个版本

高级搜索

QQ 群