How to dp-fy ml: A practical guide to machine learning with differential privacy

N Ponomareva, H Hazimeh, A Kurakin, Z Xu… - Journal of Artificial …, 2023 - jair.org
Abstract Machine Learning (ML) models are ubiquitous in real-world applications and are a
constant focus of research. Modern ML models have become more complex, deeper, and …

Grounding and evaluation for large language models: Practical challenges and lessons learned (survey)

K Kenthapadi, M Sameki, A Taly - Proceedings of the 30th ACM SIGKDD …, 2024 - dl.acm.org
With the ongoing rapid adoption of Artificial Intelligence (AI)-based systems in high-stakes
domains, ensuring the trustworthiness, safety, and observability of these systems has …

Gaussian differential privacy

J Dong, A Roth, WJ Su - Journal of the Royal Statistical Society …, 2022 - Wiley Online Library
In the past decade, differential privacy has seen remarkable success as a rigorous and
practical formalization of data privacy. This privacy definition and its divergence based …

Benchmarking differentially private synthetic data generation algorithms

Y Tao, R McKenna, M Hay, A Machanavajjhala… - arXiv preprint arXiv …, 2021 - arxiv.org
This work presents a systematic benchmark of differentially private synthetic data generation
algorithms that can generate tabular data. Utility of the synthetic data is evaluated by …

Confidence-ranked reconstruction of census microdata from published statistics

T Dick, C Dwork, M Kearns, T Liu… - Proceedings of the …, 2023 - National Acad Sciences
A reconstruction attack on a private dataset D takes as input some publicly accessible
information about the dataset and produces a list of candidate elements of D. We introduce a …

Aim: An adaptive and iterative mechanism for differentially private synthetic data

R McKenna, B Mullins, D Sheldon, G Miklau - arXiv preprint arXiv …, 2022 - arxiv.org
We propose AIM, a novel algorithm for differentially private synthetic data generation.\aim is
a workload-adaptive algorithm, within the paradigm of algorithms that first selects a set of …

Iterative methods for private synthetic data: Unifying framework and new methods

T Liu, G Vietri, SZ Wu - Advances in Neural Information …, 2021 - proceedings.neurips.cc
We study private synthetic data generation for query release, where the goal is to construct a
sanitized version of a sensitive dataset, subject to differential privacy, that approximately …

Private synthetic data for multitask learning and marginal queries

G Vietri, C Archambeau, S Aydore… - Advances in …, 2022 - proceedings.neurips.cc
We provide a differentially private algorithm for producing synthetic data simultaneously
useful for multiple tasks: marginal queries and multitask machine learning (ML). A key …

Sok: Privacy-preserving data synthesis

Y Hu, F Wu, Q Li, Y Long, GM Garrido… - … IEEE Symposium on …, 2024 - ieeexplore.ieee.org
As the prevalence of data analysis grows, safeguarding data privacy has become a
paramount concern. Consequently, there has been an upsurge in the development of …

Generating private synthetic data with genetic algorithms

T Liu, J Tang, G Vietri, S Wu - International Conference on …, 2023 - proceedings.mlr.press
We study the problem of efficiently generating differentially private synthetic data that
approximate the statistical properties of an underlying sensitive dataset. In recent years …