Privacy by design in big data: an overview of privacy enhancing technologies in the era of big data analytics

G D'Acquisto, J Domingo-Ferrer, P Kikiras… - arXiv preprint arXiv …, 2015 - arxiv.org
The extensive collection and processing of personal information in big data analytics has
given rise to serious privacy concerns, related to wide scale electronic surveillance, profiling …

Synthetic Data--what, why and how?

J Jordon, L Szpruch, F Houssiau, M Bottarelli… - arXiv preprint arXiv …, 2022 - arxiv.org
This explainer document aims to provide an overview of the current state of the rapidly
expanding work on synthetic data technologies, with a particular focus on privacy. The …

A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources

G Ghosheh, J Li, T Zhu - arXiv preprint arXiv:2203.07018, 2022 - arxiv.org
Electronic Health Records (EHRs) are a valuable asset to facilitate clinical research and
point of care applications; however, many challenges such as data privacy concerns impede …

Generation and evaluation of synthetic patient data

A Goncalves, P Ray, B Soper, J Stevens… - BMC medical research …, 2020 - Springer
Background Machine learning (ML) has made a significant impact in medicine and cancer
research; however, its impact in these areas has been undeniably slower and more limited …

A multifaceted benchmarking of synthetic electronic health record generation models

C Yan, Y Yan, Z Wan, Z Zhang, L Omberg… - Nature …, 2022 - nature.com
Synthetic health data have the potential to mitigate privacy concerns in supporting
biomedical research and healthcare applications. Modern approaches for data generation …

A multi-dimensional evaluation of synthetic data generators

FK Dankar, MK Ibrahim, L Ismail - IEEE Access, 2022 - ieeexplore.ieee.org
Synthetic datasets are gradually emerging as solutions for data sharing. Multiple synthetic
data generators have been introduced in the last decade fueled by advancement in machine …

Fake it till you make it: Guidelines for effective synthetic data generation

FK Dankar, M Ibrahim - Applied Sciences, 2021 - mdpi.com
Synthetic data provides a privacy protecting mechanism for the broad usage and sharing of
healthcare data for secondary purposes. It is considered a safe approach for the sharing of …

General and specific utility measures for synthetic data

J Snoke, GM Raab, B Nowok, C Dibben… - Journal of the Royal …, 2018 - academic.oup.com
Data holders can produce synthetic versions of data sets when concerns about potential
disclosure restrict the availability of the original records. The paper is concerned with …

Linking sensitive data

P Christen, T Ranbaduge, R Schnell - Methods and techniques for …, 2020 - Springer
Sensitive personal data are created in many application domains, and there is now an
increasing demand to share, integrate, and link such data within and across organisations in …

[图书][B] Synthetic datasets for statistical disclosure control: theory and implementation

J Drechsler - 2011 - books.google.com
The aim of this book is to give the reader a detailed introduction to the different approaches
to generating multiply imputed synthetic datasets. It describes all approaches that have been …