Generating and evaluating cross‐sectional synthetic electronic healthcare data: Preserving data utility and patient privacy

Z Wang, P Myles, A Tucker - Computational Intelligence, 2021 - Wiley Online Library
Electronic healthcare record data have been used to study risk factors of disease, treatment
effectiveness and safety, and to inform healthcare service planning. There has been …

[HTML][HTML] Generating high-fidelity synthetic patient data for assessing machine learning healthcare software

A Tucker, Z Wang, Y Rotalinti, P Myles - NPJ digital medicine, 2020 - nature.com
There is a growing demand for the uptake of modern artificial intelligence technologies
within healthcare systems. Many of these technologies exploit historical patient health data …

Generating and evaluating synthetic UK primary care data: preserving data utility & patient privacy

Z Wang, P Myles, A Tucker - 2019 IEEE 32nd International …, 2019 - ieeexplore.ieee.org
There is increasing interest in the potential of synthetic data to validate and benchmark
machine learning algorithms as well as reveal any biases in real-world data used for …

Synthetic data generation: State of the art in health care domain

H Murtaza, M Ahmed, NF Khan, G Murtaza… - Computer Science …, 2023 - Elsevier
Recent progress in artificial intelligence and machine learning has led to the growth of
research in every aspect of life including the health care domain. However, privacy risks and …

A multifaceted benchmarking of synthetic electronic health record generation models

C Yan, Y Yan, Z Wan, Z Zhang, L Omberg… - Nature …, 2022 - nature.com
Synthetic health data have the potential to mitigate privacy concerns in supporting
biomedical research and healthcare applications. Modern approaches for data generation …

Fake it till you make it: Guidelines for effective synthetic data generation

FK Dankar, M Ibrahim - Applied Sciences, 2021 - mdpi.com
Synthetic data provides a privacy protecting mechanism for the broad usage and sharing of
healthcare data for secondary purposes. It is considered a safe approach for the sharing of …

Synthetic data use: exploring use cases to optimise data utility

S James, C Harbron, J Branson, M Sundler - Discover Artificial Intelligence, 2021 - Springer
Synthetic data is a rapidly evolving field with growing interest from multiple industry
stakeholders and European bodies. In particular, the pharmaceutical industry is starting to …

Synthetic patient data generation and evaluation in disease prediction using small and imbalanced datasets

AJ Rodriguez-Almeida, H Fabelo… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
The increasing prevalence of chronic non-communicable diseases makes it a priority to
develop tools for enhancing their management. On this matter, Artificial Intelligence …

Synthetic data as an enabler for machine learning applications in medicine

JF Rajotte, R Bergen, DL Buckeridge, K El Emam, R Ng… - Iscience, 2022 - cell.com
Synthetic data generation is the process of using machine learning methods to train a model
that captures the patterns in a real dataset. Then new or synthetic data can be generated …

Challenges and opportunities beyond structured data in analysis of electronic health records

M Tayefi, P Ngo, T Chomutare… - Wiley …, 2021 - Wiley Online Library
Electronic health records (EHR) contain a lot of valuable information about individual
patients and the whole population. Besides structured data, unstructured data in EHRs can …