Balancing inferential integrity and disclosure risk via model targeted masking and multiple imputation

B Jiang, AE Raftery, RJ Steele… - Journal of the American …, 2021 - Taylor & Francis
There is a growing expectation that data collected by government-funded studies should be
openly available to ensure research reproducibility, which also increases concerns about …

Disclosure control using partially synthetic data for large‐scale health surveys, with applications to CanCORS

B Loong, AM Zaslavsky, Y He… - Statistics in …, 2013 - Wiley Online Library
Statistical agencies have begun to partially synthesize public‐use data for major surveys to
protect the confidentiality of respondents' identities and sensitive attributes by replacing high …

Selecting the number of imputed datasets when using multiple imputation for missing data and disclosure limitation

JP Reiter - Statistics & probability letters, 2008 - Elsevier
Selecting the number of imputed datasets when using multiple imputation for missing data and
disclosure limitation - ScienceDirect Skip to main contentSkip to article Elsevier logo Journals …

A novel imputation approach for sharing protected public health data

EA Erdman, LD Young, DL Bernson… - … Journal of Public …, 2021 - ajph.aphapublications.org
Objectives. To develop an imputation method to produce estimates for suppressed values
within a shared government administrative data set to facilitate accurate data sharing and …

[HTML][HTML] Multiple imputation of missing data under missing at random: compatible imputation models are not sufficient to avoid bias if they are mis-specified

E Curnow, JR Carpenter, JE Heron, RP Cornish… - Journal of clinical …, 2023 - Elsevier
Objectives Epidemiological studies often have missing data, which are commonly handled
by multiple imputation (MI). Standard (default) MI procedures use simple linear covariate …

Multiple imputation for incomplete data in epidemiologic studies

O Harel, EM Mitchell, NJ Perkins… - American journal of …, 2018 - academic.oup.com
Epidemiologic studies are frequently susceptible to missing information. Omitting
observations with missing variables remains a common strategy in epidemiologic studies …

[PDF][PDF] Multiple Imputation and Synthetic Data Generation with NPBayesImputeCat.

J Hu, O Akande, Q Wang - R Journal, 2021 - rjournal.github.io
In many contexts, missing data and disclosure control are ubiquitous and challenging
issues. In particular, at statistical agencies, the respondent-level data they collect from …

Recovery of information from multiple imputation: a simulation study

KJ Lee, JB Carlin - Emerging themes in epidemiology, 2012 - Springer
Background Multiple imputation is becoming increasingly popular for handling missing data.
However, it is often implemented without adequate consideration of whether it offers any …

On the use of multiple imputation to address data missing by design as well as unintended missing data in case-cohort studies with a binary endpoint

M Middleton, C Nguyen, JB Carlin… - BMC Medical Research …, 2023 - Springer
Background Case-cohort studies are conducted within cohort studies, with the defining
feature that collection of exposure data is limited to a subset of the cohort, leading to a large …

Outcome-sensitive multiple imputation: a simulation study

E Kontopantelis, IR White, M Sperrin… - BMC medical research …, 2017 - Springer
Background Multiple imputation is frequently used to deal with missing data in healthcare
research. Although it is known that the outcome should be included in the imputation model …