Synthetic data generation for tabular health records: A systematic review

M Hernandez, G Epelde, A Alberdi, R Cilla, D Rankin - Neurocomputing, 2022 - Elsevier
Synthetic data generation (SDG) research has been ongoing for some time with promising
results in different application domains, including healthcare, biometrics and energy …

Natural language processing in electronic health records in relation to healthcare decision-making: a systematic review

E Hossain, R Rana, N Higgins, J Soar, PD Barua… - Computers in biology …, 2023 - Elsevier
Abstract Background: Natural Language Processing (NLP) is widely used to extract clinical
insights from Electronic Health Records (EHRs). However, the lack of annotated data …

[图书][B] Synthetic data for deep learning

SI Nikolenko - 2021 - Springer
You are holding in your hands… oh, come on, who holds books like this in their hands
anymore? Anyway, you are reading this, and it means that I have managed to release one of …

Generative adversarial networks and its applications in biomedical informatics

L Lan, L You, Z Zhang, Z Fan, W Zhao, N Zeng… - Frontiers in public …, 2020 - frontiersin.org
The basic Generative Adversarial Networks (GAN) model is composed of the input vector,
generator, and discriminator. Among them, the generator and discriminator are implicit …

Differentially private synthetic medical data generation using convolutional GANs

A Torfi, EA Fox, CK Reddy - Information Sciences, 2022 - Elsevier
Deep learning models have demonstrated superior performance in several real-world
application problems such as image classification and speech processing. However …

Downstream task performance of BERT models pre-trained using automatically de-identified clinical data

T Vakili, A Lamproudis, A Henriksson… - Proceedings of the …, 2022 - aclanthology.org
Automatic de-identification is a cost-effective and straightforward way of removing large
amounts of personally identifiable information from large and sensitive corpora. However …

Explanatory predictive model for COVID-19 severity risk employing machine learning, shapley addition, and LIME

M Laatifi, S Douzi, H Ezzine, CE Asry, A Naya… - Scientific Reports, 2023 - nature.com
The rapid spread of SARS-CoV-2 threatens global public health and impedes the operation
of healthcare systems. Several studies have been conducted to confirm SARS-CoV-2 …

Synthetic tabular data evaluation in the health domain covering resemblance, utility, and privacy dimensions

M Hernadez, G Epelde, A Alberdi… - … of information in …, 2023 - thieme-connect.com
Background Synthetic tabular data generation is a potentially valuable technology with great
promise for data augmentation and privacy preservation. However, prior to adoption, an …

Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition

J Li, Y Zhou, X Jiang, K Natarajan… - Journal of the …, 2021 - academic.oup.com
Objective: Developing clinical natural language processing systems often requires access to
many clinical documents, which are not widely available to the public due to privacy and …

Twin: Personalized clinical trial digital twin generation

T Das, Z Wang, J Sun - Proceedings of the 29th ACM SIGKDD …, 2023 - dl.acm.org
Clinical trial digital twins are virtual patients that reflect personal characteristics in a high
degree of granularity and can be used to simulate various patient outcomes under different …