An evaluation of the replicability of analyses using synthetic health data

K El Emam, L Mosquera, X Fang, A El-Hussuna - Scientific Reports, 2024 - nature.com
Synthetic data generation is being increasingly used as a privacy preserving approach for
sharing health data. In addition to protecting privacy, it is important to ensure that generated …

Assessing privacy and quality of synthetic health data

A Yale, S Dash, R Dutta, I Guyon, A Pavao… - Proceedings of the …, 2019 - dl.acm.org
This paper builds on the results of the ESANN 2019 conference paper" Privacy Preserving
Synthetic Health Data"[16], which develops metrics for assessing privacy and utility of …

Fake it till you make it: Guidelines for effective synthetic data generation

FK Dankar, M Ibrahim - Applied Sciences, 2021 - mdpi.com
Synthetic data provides a privacy protecting mechanism for the broad usage and sharing of
healthcare data for secondary purposes. It is considered a safe approach for the sharing of …

A primer on synthetic health data

JA Bartell, SB Valentin, A Krogh, H Langberg… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advances in deep generative models have greatly expanded the potential to create
realistic synthetic health datasets. These synthetic datasets aim to preserve the …

A multifaceted benchmarking of synthetic electronic health record generation models

C Yan, Y Yan, Z Wan, Z Zhang, L Omberg… - Nature …, 2022 - nature.com
Synthetic health data have the potential to mitigate privacy concerns in supporting
biomedical research and healthcare applications. Modern approaches for data generation …

A scoping review of privacy and utility metrics in medical synthetic data

B Kaabachi, J Despraz, T Meurers, K Otte… - npj Digital …, 2025 - nature.com
The use of synthetic data is a promising solution to facilitate the sharing and reuse of health-
related data beyond its initial collection while addressing privacy concerns. However, there …

[HTML][HTML] Evaluating identity disclosure risk in fully synthetic health data: model development and validation

K El Emam, L Mosquera, J Bass - Journal of medical Internet research, 2020 - jmir.org
Background There has been growing interest in data synthesis for enabling the sharing of
data for secondary analysis; however, there is a need for a comprehensive privacy risk …

Generation and evaluation of privacy preserving synthetic health data

A Yale, S Dash, R Dutta, I Guyon, A Pavao, KP Bennett - Neurocomputing, 2020 - Elsevier
We develop metrics for measuring the quality of synthetic health data for both education and
research. We use novel and existing metrics to capture a synthetic dataset's resemblance …

Synthetic data generation: State of the art in health care domain

H Murtaza, M Ahmed, NF Khan, G Murtaza… - Computer Science …, 2023 - Elsevier
Recent progress in artificial intelligence and machine learning has led to the growth of
research in every aspect of life including the health care domain. However, privacy risks and …

Health synthetic data to enable health learning system and innovation: a scoping review

SF Tsao, K Sharma, H Noor, A Forster… - Caring is Sharing …, 2023 - ebooks.iospress.nl
With the recent advancement in the field of machine learning, health synthetic data has
become a promising technique to address difficulties with time consumption when accessing …