A scaling approach to record linkage

H Goldstein, K Harron, M Cortina‐Borja - Statistics in medicine, 2017 - Wiley Online Library
With increasing availability of large datasets derived from administrative and other sources,
there is an increasing demand for the successful linking of these to provide rich sources of …

A new computationally efficient algorithm for record linkage with field dependency and missing data imputation

J Ferguson, A Hannigan, A Stack - International journal of medical …, 2018 - Elsevier
Record linkage algorithms aim to identify pairs of records that correspond to the same
individual from two or more datasets. In general, fields that are common to both datasets are …

Empirical aspects of record linkage across multiple data sets using statistical linkage keys: the experience of the PIAC cohort study

R Karmel, P Anderson, D Gibson, A Peut… - BMC Health Services …, 2010 - Springer
Abstract Background In Australia, many community service program data collections
developed over the last decade, including several for aged care programs, contain a …

CIDACS-RL: a novel indexing search and scoring-based record linkage system for huge datasets with high accuracy and scalability

GCG Barbosa, MS Ali, B Araujo, S Reis, S Sena… - BMC medical informatics …, 2020 - Springer
Background Record linkage is the process of identifying and combining records about the
same individual from two or more different datasets. While there are many open source and …

A simple sampling method for estimating the accuracy of large scale record linkage projects

JH Boyd, T Guiver, SM Randall… - … of information in …, 2016 - thieme-connect.com
Background: Record linkage techniques allow different data collections to be brought
together to provide a wider picture of the health status of individuals. Ensuring high linkage …

Technical challenges of providing record linkage services for research

JH Boyd, SM Randall, AM Ferrante, JK Bauer… - BMC medical informatics …, 2014 - Springer
Background Record linkage techniques are widely used to enable health researchers to
gain event based longitudinal information for entire populations. The task of record linkage …

[HTML][HTML] When to conduct probabilistic linkage vs. deterministic linkage? A simulation study

Y Zhu, Y Matsuyama, Y Ohashi, S Setoguchi - Journal of biomedical …, 2015 - Elsevier
Introduction When unique identifiers are unavailable, successful record linkage depends
greatly on data quality and types of variables available. While probabilistic linkage …

A practical approach for incorporating dependence among fields in probabilistic record linkage

JK Daggy, H Xu, SL Hui, RE Gamache… - BMC medical informatics …, 2013 - Springer
Background Methods for linking real-world healthcare data often use a latent class model,
where the latent, or unknown, class is the true match status of candidate record-pairs. This …

[HTML][HTML] Improving record linkage performance in the presence of missing linkage data

TC Ong, MV Mannino, LM Schilling, MG Kahn - Journal of biomedical …, 2014 - Elsevier
Introduction Existing record linkage methods do not handle missing linking field values in an
efficient and effective manner. The objective of this study is to investigate three novel …

[PDF][PDF] Towards automated record linkage

K Goiser, P Christen - AusDM, 2006 - ausdm.org
Abstract The field of Record Linkage is concerned with identifying records from one or more
datasets which refer to the same underlying entities. Where entity-unique identifiers are not …