How to dp-fy ml: A practical guide to machine learning with differential privacy

N Ponomareva, H Hazimeh, A Kurakin, Z Xu… - Journal of Artificial …, 2023 - jair.org
Abstract Machine Learning (ML) models are ubiquitous in real-world applications and are a
constant focus of research. Modern ML models have become more complex, deeper, and …

Machine learning for synthetic data generation: a review

Y Lu, M Shen, H Wang, X Wang, C van Rechem… - arXiv preprint arXiv …, 2023 - arxiv.org
Machine learning heavily relies on data, but real-world applications often encounter various
data-related issues. These include data of poor quality, insufficient data points leading to …

Anonymization techniques for privacy preserving data publishing: A comprehensive survey

A Majeed, S Lee - IEEE access, 2020 - ieeexplore.ieee.org
Anonymization is a practical solution for preserving user's privacy in data publishing. Data
owners such as hospitals, banks, social network (SN) service providers, and insurance …

Virtual homogeneity learning: Defending against data heterogeneity in federated learning

Z Tang, Y Zhang, S Shi, X He… - … on Machine Learning, 2022 - proceedings.mlr.press
In federated learning (FL), model performance typically suffers from client drift induced by
data heterogeneity, and mainstream works focus on correcting client drift. We propose a …

The algorithmic foundations of differential privacy

C Dwork, A Roth - Foundations and Trends® in Theoretical …, 2014 - nowpublishers.com
The problem of privacy-preserving data analysis has a long history spanning multiple
disciplines. As electronic data about individuals becomes increasingly detailed, and as …

More than privacy: Applying differential privacy in key areas of artificial intelligence

T Zhu, D Ye, W Wang, W Zhou… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Artificial Intelligence (AI) has attracted a great deal of attention in recent years. However,
alongside all its advancements, problems have also emerged, such as privacy violations …

Towards practical differential privacy for SQL queries

N Johnson, JP Near, D Song - Proceedings of the VLDB Endowment, 2018 - dl.acm.org
Differential privacy promises to enable general data analytics while protecting individual
privacy, but existing differential privacy mechanisms do not support the wide variety of …

Privbayes: Private data release via bayesian networks

J Zhang, G Cormode, CM Procopiuc… - ACM Transactions on …, 2017 - dl.acm.org
Privacy-preserving data publishing is an important problem that has been the focus of
extensive study. The state-of-the-art solution for this problem is differential privacy, which …

The complexity of differential privacy

S Vadhan - Tutorials on the Foundations of Cryptography …, 2017 - Springer
Differential privacy is a theoretical framework for ensuring the privacy of individual-level data
when performing statistical analysis of privacy-sensitive datasets. This tutorial provides an …

Winning the NIST Contest: A scalable and general approach to differentially private synthetic data

R McKenna, G Miklau, D Sheldon - arXiv preprint arXiv:2108.04978, 2021 - arxiv.org
We propose a general approach for differentially private synthetic data generation, that
consists of three steps:(1) select a collection of low-dimensional marginals,(2) measure …