[HTML][HTML] Reliability of supervised machine learning using synthetic data in health care: Model to preserve privacy for data sharing

D Rankin, M Black, R Bond, J Wallace… - JMIR medical …, 2020 - medinform.jmir.org
Background: The exploitation of synthetic data in health care is at an early stage. Synthetic
data could unlock the potential within health care datasets that are too sensitive for release …

[HTML][HTML] Analyzing medical research results based on synthetic data and their relation to real data results: systematic comparison from five observational studies

AR Benaim, R Almog, Y Gorelik… - JMIR medical …, 2020 - medinform.jmir.org
Background: Privacy restrictions limit access to protected patient-derived health information
for research purposes. Consequently, data anonymization is required to allow researchers …

[HTML][HTML] Utility metrics for evaluating synthetic health data generation methods: validation study

K El Emam, L Mosquera, X Fang… - JMIR medical …, 2022 - medinform.jmir.org
Background A regular task by developers and users of synthetic data generation (SDG)
methods is to evaluate and compare the utility of these methods. Multiple utility metrics have …

[HTML][HTML] Learning from others without sacrificing privacy: Simulation comparing centralized and federated machine learning on mobile health data

JC Liu, J Goetz, S Sen, A Tewari - JMIR mHealth and uHealth, 2021 - mhealth.jmir.org
Background The use of wearables facilitates data collection at a previously unobtainable
scale, enabling the construction of complex predictive models with the potential to improve …

[HTML][HTML] Data anonymization for pervasive health care: systematic literature mapping study

Z Zuo, M Watson, D Budgen, R Hall… - JMIR medical …, 2021 - medinform.jmir.org
Background Data science offers an unparalleled opportunity to identify new insights into
many aspects of human life with recent advances in health care. Using data science in …

Synthetic data generation: State of the art in health care domain

H Murtaza, M Ahmed, NF Khan, G Murtaza… - Computer Science …, 2023 - Elsevier
Recent progress in artificial intelligence and machine learning has led to the growth of
research in every aspect of life including the health care domain. However, privacy risks and …

[PDF][PDF] Synthetic data as an enabler for machine learning applications in medicine

JF Rajotte, R Bergen, DL Buckeridge, K El Emam, R Ng… - Iscience, 2022 - cell.com
Synthetic data generation is the process of using machine learning methods to train a model
that captures the patterns in a real dataset. Then new or synthetic data can be generated …

[HTML][HTML] Fake it till you make it: Guidelines for effective synthetic data generation

FK Dankar, M Ibrahim - Applied Sciences, 2021 - mdpi.com
Synthetic data provides a privacy protecting mechanism for the broad usage and sharing of
healthcare data for secondary purposes. It is considered a safe approach for the sharing of …

[HTML][HTML] Characterizing and managing missing structured data in electronic health records: data analysis

BK Beaulieu-Jones, DR Lavage… - JMIR medical …, 2018 - medinform.jmir.org
Background: Missing data is a challenge for all studies; however, this is especially true for
electronic health record (EHR)-based analyses. Failure to appropriately consider missing …

Generating and evaluating cross‐sectional synthetic electronic healthcare data: Preserving data utility and patient privacy

Z Wang, P Myles, A Tucker - Computational Intelligence, 2021 - Wiley Online Library
Electronic healthcare record data have been used to study risk factors of disease, treatment
effectiveness and safety, and to inform healthcare service planning. There has been …