Synthetic Data--what, why and how?

J Jordon, L Szpruch, F Houssiau, M Bottarelli… - arXiv preprint arXiv …, 2022 - arxiv.org
This explainer document aims to provide an overview of the current state of the rapidly
expanding work on synthetic data technologies, with a particular focus on privacy. The …

[PDF][PDF] PATE-GAN: Generating synthetic data with differential privacy guarantees

J Jordon, J Yoon, M Van Der Schaar - International conference on …, 2018 - openreview.net
Machine learning has the potential to assist many communities in using the large datasets
that are becoming more and more available. Unfortunately, much of that potential is not …

The complexity of differential privacy

S Vadhan - Tutorials on the Foundations of Cryptography …, 2017 - Springer
Differential privacy is a theoretical framework for ensuring the privacy of individual-level data
when performing statistical analysis of privacy-sensitive datasets. This tutorial provides an …

Boosting and differential privacy

C Dwork, GN Rothblum… - 2010 IEEE 51st annual …, 2010 - ieeexplore.ieee.org
Boosting is a general method for improving the accuracy of learning algorithms. We use
boosting to construct improved privacy-pre serving synopses of an input database. These …

{PrivSyn}: Differentially private data synthesis

Z Zhang, T Wang, N Li, J Honorio, M Backes… - 30th USENIX Security …, 2021 - usenix.org
In differential privacy (DP), a challenging problem is to generate synthetic datasets that
efficiently capture the useful information in the private data. The synthetic dataset enables …

A simple and practical algorithm for differentially private data release

M Hardt, K Ligett, F McSherry - Advances in neural …, 2012 - proceedings.neurips.cc
We present a new algorithm for differentially private data release, based on a simple
combination of the Exponential Mechanism with the Multiplicative Weights update rule. Our …

A learning theory approach to noninteractive database privacy

A Blum, K Ligett, A Roth - Journal of the ACM (JACM), 2013 - dl.acm.org
In this article, we demonstrate that, ignoring computational constraints, it is possible to
release synthetic databases that are useful for accurately answering large classes of queries …

A multiplicative weights mechanism for privacy-preserving data analysis

M Hardt, GN Rothblum - 2010 IEEE 51st annual symposium on …, 2010 - ieeexplore.ieee.org
We consider statistical data analysis in the interactive setting. In this setting a trusted curator
maintains a database of sensitive information about individual participants, and releases …

Differentially private query release through adaptive projection

S Aydore, W Brown, M Kearns… - International …, 2021 - proceedings.mlr.press
We propose, implement, and evaluate a new algo-rithm for releasing answers to very large
numbersof statistical queries likek-way marginals, sub-ject to differential privacy. Our …

[图书][B] Privacy, big data, and the public good: Frameworks for engagement

J Lane, V Stodden, S Bender, H Nissenbaum - 2014 - books.google.com
Massive amounts of data on human beings can now be analyzed. Pragmatic purposes
abound, including selling goods and services, winning political campaigns, and identifying …