Evaluation of record linkage methods for iterative insertions

M Sariyar, A Borg… - Methods of information in …, 2009 - thieme-connect.com
Objectives: There have been many developments and applications of mathematical
methods in the context of record linkage as one area of interdisciplinary research efforts …

An empiric weight computation for record linkage using linearly combined fields' similarity scores

X Li, A Guttmann, J Demongeot… - 2014 36th Annual …, 2014 - ieeexplore.ieee.org
Record linkage is the task of identifying which records from one or more data sources refer
to the same entity. Many record linkage methods were introduced and applied over the last …

Implementation of an extended Fellegi-Sunter probabilistic record linkage method using the Jaro-Winkler string comparator

X Li, A Guttmann, S Cipière, L Maigne… - … on Biomedical and …, 2014 - ieeexplore.ieee.org
Record linkage is the task of identifying which records from one or more data sources refer
to the same person. Often, records do not have a common key and may contain …

A new computationally efficient algorithm for record linkage with field dependency and missing data imputation

J Ferguson, A Hannigan, A Stack - International journal of medical …, 2018 - Elsevier
Record linkage algorithms aim to identify pairs of records that correspond to the same
individual from two or more datasets. In general, fields that are common to both datasets are …

An improved fellegi-sunter framework for probabilistic record linkage between large data sets

M Fortini - Journal of Official Statistics, 2020 - journals.sagepub.com
Record linkage addresses the problem of identifying pairs of records coming from different
sources and referred to the same unit of interest. Fellegi and Sunter propose an optimal …

A scaling approach to record linkage

H Goldstein, K Harron, M Cortina‐Borja - Statistics in medicine, 2017 - Wiley Online Library
With increasing availability of large datasets derived from administrative and other sources,
there is an increasing demand for the successful linking of these to provide rich sources of …

Improving Probabilistic Record Linkage Using Statistical Prediction Models

A Moretti, N Shlomo - International Statistical Review, 2023 - Wiley Online Library
Record linkage brings together information from records in two or more data sources that are
believed to belong to the same statistical unit based on a common set of matching variables …

Variable selection for latent class analysis in the presence of missing data with application to record linkage

H Xu, X Li, Z Zhang, S Grannis - Statistical Methods in …, 2024 - journals.sagepub.com
<? show [AQ ID= GQ2 POS=-12pt]?><? show [AQ ID= GQ4 POS= 6pt]?><? show [AQ ID=
GQ5 POS= 18pt]?> The Fellegi-Sunter model is a latent class model widely used in …

Bagging, bumping, multiview, and active learning for record linkage with empirical results on patient identity data

M Sariyar, A Borg - Computer methods and programs in biomedicine, 2012 - Elsevier
Record linkage or deduplication deals with the detection and deletion of duplicates in and
across files. For this task, this paper introduces and evaluates two new machine-learning …

[HTML][HTML] Improving record linkage performance in the presence of missing linkage data

TC Ong, MV Mannino, LM Schilling, MG Kahn - Journal of biomedical …, 2014 - Elsevier
Introduction Existing record linkage methods do not handle missing linking field values in an
efficient and effective manner. The objective of this study is to investigate three novel …