A novel imputation approach for sharing protected public health data

EA Erdman, LD Young, DL Bernson… - … Journal of Public …, 2021 - ajph.aphapublications.org
Objectives. To develop an imputation method to produce estimates for suppressed values
within a shared government administrative data set to facilitate accurate data sharing and …

An evaluation of the replicability of analyses using synthetic health data

K El Emam, L Mosquera, X Fang, A El-Hussuna - Scientific Reports, 2024 - nature.com
Synthetic data generation is being increasingly used as a privacy preserving approach for
sharing health data. In addition to protecting privacy, it is important to ensure that generated …

DataSifter II: Partially synthetic data sharing of sensitive information containing time-varying correlated observations

N Zhou, L Wang, S Marino, Y Zhao… - Journal of algorithms & …, 2022 - journals.sagepub.com
There is a significant public demand for rapid data-driven scientific investigations using
aggregated sensitive information. However, many technical challenges and regulatory …

Multiple imputation for missing income data in population-based health surveillance

Z Zeng - Journal of Public Health Management and Practice, 2009 - journals.lww.com
Background Although advanced multiple imputation (MI) methodology has become widely
introduced and increasingly used, few have reported for health surveillance, where missing …

Multiple imputation for incomplete data in epidemiologic studies

O Harel, EM Mitchell, NJ Perkins… - American journal of …, 2018 - academic.oup.com
Epidemiologic studies are frequently susceptible to missing information. Omitting
observations with missing variables remains a common strategy in epidemiologic studies …

Disclosure control using partially synthetic data for large‐scale health surveys, with applications to CanCORS

B Loong, AM Zaslavsky, Y He… - Statistics in …, 2013 - Wiley Online Library
Statistical agencies have begun to partially synthesize public‐use data for major surveys to
protect the confidentiality of respondents' identities and sensitive attributes by replacing high …

Balancing inferential integrity and disclosure risk via model targeted masking and multiple imputation

B Jiang, AE Raftery, RJ Steele… - Journal of the American …, 2021 - Taylor & Francis
There is a growing expectation that data collected by government-funded studies should be
openly available to ensure research reproducibility, which also increases concerns about …

Multiple imputation for disclosure limitation: Future research challenges

JP Reiter - Journal of Privacy and Confidentiality, 2010 - journalprivacyconfidentiality.org
Statistical agencies that disseminate data to the public are ethically and often legally
required to protect the confidentiality of respondents' identities and sensitive attributes. To …

Data, privacy, and the greater good

E Horvitz, D Mulligan - Science, 2015 - science.org
Large-scale aggregate analyses of anonymized data can yield valuable results and insights
that address public health challenges and provide new avenues for scientific discovery …

Standard multiple imputation of survey data didn't perform better than simple substitution in enhancing an administrative dataset: the example of self-rated health in …

F Popham, E Whitley, O Molaodi, L Gray - Emerging themes in …, 2021 - Springer
Background Health surveys provide a rich array of information but on relatively small
numbers of individuals and evidence suggests that they are becoming less representative …