Fake it till you make it: Guidelines for effective synthetic data generation

FK Dankar, M Ibrahim - Applied Sciences, 2021 - mdpi.com
Synthetic data provides a privacy protecting mechanism for the broad usage and sharing of
healthcare data for secondary purposes. It is considered a safe approach for the sharing of …

Synthetic data generation: State of the art in health care domain

H Murtaza, M Ahmed, NF Khan, G Murtaza… - Computer Science …, 2023 - Elsevier
Recent progress in artificial intelligence and machine learning has led to the growth of
research in every aspect of life including the health care domain. However, privacy risks and …

Synthetic data use: exploring use cases to optimise data utility

S James, C Harbron, J Branson, M Sundler - Discover Artificial Intelligence, 2021 - Springer
Synthetic data is a rapidly evolving field with growing interest from multiple industry
stakeholders and European bodies. In particular, the pharmaceutical industry is starting to …

A multifaceted benchmarking of synthetic electronic health record generation models

C Yan, Y Yan, Z Wan, Z Zhang, L Omberg… - Nature …, 2022 - nature.com
Synthetic health data have the potential to mitigate privacy concerns in supporting
biomedical research and healthcare applications. Modern approaches for data generation …

Generating and evaluating cross‐sectional synthetic electronic healthcare data: Preserving data utility and patient privacy

Z Wang, P Myles, A Tucker - Computational Intelligence, 2021 - Wiley Online Library
Electronic healthcare record data have been used to study risk factors of disease, treatment
effectiveness and safety, and to inform healthcare service planning. There has been …

Synthetic data as an enabler for machine learning applications in medicine

JF Rajotte, R Bergen, DL Buckeridge, K El Emam, R Ng… - Iscience, 2022 - cell.com
Synthetic data generation is the process of using machine learning methods to train a model
that captures the patterns in a real dataset. Then new or synthetic data can be generated …

Spot the difference: comparing results of analyses from real patient data and synthetic derivatives

RE Foraker, SC Yu, A Gupta, AP Michelson… - JAMIA …, 2020 - academic.oup.com
Background Synthetic data may provide a solution to researchers who wish to generate and
share data in support of precision healthcare. Recent advances in data synthesis enable the …

[HTML][HTML] Membership inference attacks against synthetic health data

Z Zhang, C Yan, BA Malin - Journal of biomedical informatics, 2022 - Elsevier
Synthetic data generation has emerged as a promising method to protect patient privacy
while sharing individual-level health data. Intuitively, sharing synthetic data should reduce …

A survey of synthetic data generation for machine learning

M Abufadda, K Mansour - 2021 22nd international arab …, 2021 - ieeexplore.ieee.org
Data is the fuel of machine learning algorithms, therefore data generation in machine
learning is becoming an important topic. The problem is that finding enough data for …

[图书][B] Practical synthetic data generation: balancing privacy and the broad availability of data

K El Emam, L Mosquera, R Hoptroff - 2020 - books.google.com
Building and testing machine learning models requires access to large and diverse data. But
where can you find usable datasets without running into privacy issues? This practical book …