Preserving privacy in large language models: A survey on current threats and solutions

M Miranda, ES Ruzzetti, A Santilli, FM Zanzotto… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) represent a significant advancement in artificial
intelligence, finding applications across various domains. However, their reliance on …

Generative AI for synthetic data across multiple medical modalities: A systematic review of recent developments and challenges

M Ibrahim, YA Khalil, S Amirrajab, C Sun… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents a comprehensive systematic review of generative models (GANs, VAEs,
DMs, and LLMs) used to synthesize various medical data types, including imaging …

Synthetic Health Data: Real Ethical Promise and Peril

D Susser, DS Schiff, S Gerke, LY Cabrera… - Hastings Center …, 2024 - Wiley Online Library
Researchers and practitioners are increasingly using machine‐generated synthetic data as
a tool for advancing health science and practice, by expanding access to health data while …

Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes

S Kweon, J Kim, J Kim, S Im, E Cho, S Bae, J Oh… - arXiv preprint arXiv …, 2023 - arxiv.org
The development of large language models tailored for handling patients' clinical notes is
often hindered by the limited accessibility and usability of these notes due to strict privacy …

[HTML][HTML] The impact of collaborative documentation on person-centered care: Textual analysis of clinical notes

V Stanhope, N Yoo, E Matthews… - JMIR Medical …, 2024 - medinform.jmir.org
Background Collaborative documentation (CD) is a behavioral health practice involving
shared writing of clinic visit notes by providers and consumers. Despite widespread …

A Review on Generative AI Models for Synthetic Medical Text, Time Series, and Longitudinal Data

M Loni, F Poursalim, M Asadi… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents the results of a novel scoping review on the practical models for
generating three different types of synthetic health records (SHRs): medical text, time series …

Preserving Privacy During Reinforcement Learning With AI Feedback

D Gao, I Miller, A Allami, D Lin - 2024 IEEE 6th International …, 2024 - ieeexplore.ieee.org
Leveraging the scalable efficacy of reinforcement learning from AI feedback (RLAIF), large
language models (LLMs) can be refined toward human intent alignment. While current …

Enhancing Skin Cancer Detection with Multimodal Data Integration: A Combined Approach Using Images and Clinical Notes

V Chakkarapani, S Poornapushpakala, S Suresh - SN Computer Science, 2025 - Springer
Skin cancer is one of the most common cancers worldwide where early detection is crucial
for effectively diagnosing and treatment. Traditional diagnostic methods largely depend on …

A Novel Taxonomy for Navigating and Classifying Synthetic Data in Healthcare Applications

BVAN DIJK, S ul ISLAM, J Achterberg, H MUHAMMAD… - 2024 - ebooks.iospress.nl
Data-driven technologies have improved the efficiency, reliability and effectiveness of
healthcare services, but come with an increasing demand for data, which is challenging due …

[PDF][PDF] Generating English Synthetic Documents with Clinical Keywords: A Privacy-Sensitive Methodology

S Meoni, É de la Clergerie, T Ryffel - LREC-COLING 2024, 2024 - aclanthology.org
Abstract Electronic Health Records (EHR) store valuable patient-staff interaction data. These
notes, often unstructured to save healthcare personnel time, can be challenging to analyze …