How well do automated linking methods perform? Lessons from US historical data

MJ Bailey, C Cole, M Henderson… - Journal of economic …, 2020 - aeaweb.org
This paper reviews the literature in historical record linkage in the United States and
examines the performance of widely used record-linking algorithms and common variations …

Historical census record linkage

S Ruggles, CA Fitch, E Roberts - Annual review of sociology, 2018 - annualreviews.org
For the past 80 years, social scientists have been linking historical censuses across time to
study economic and geographic mobility. In recent decades, the quantity of historical census …

A new strategy for linking US historical censuses: A case study for the IPUMS multigenerational longitudinal panel

J Helgertz, J Price, J Wellington… - Historical Methods: A …, 2022 - Taylor & Francis
This paper presents a probabilistic method of record linkage, developed using the US full
count censuses of 1900 and 1910 but applicable to many sources of digitized historical …

[HTML][HTML] Machine learning for science and society

C Rudin, KL Wagstaff - Machine Learning, 2014 - Springer
The special issue on “Machine Learning for Science and Society” showcases machine
learning work with influence on our current and future society. These papers address …

Testing methods of record linkage on Swedish censuses

MJ Wisselgren, S Edvinsson, M Berggren… - Historical Methods: A …, 2014 - Taylor & Francis
Research benefits a great deal when different kinds of registers can be combined. Record
linkage is an important tool for connecting sources to create longitudinal databases of …

Playing with matches: An assessment of accuracy in linked historical data

CG Massey - Historical Methods: A Journal of Quantitative and …, 2017 - Taylor & Francis
This article evaluates linkage quality achieved by various record linkage techniques used in
historical demography. The author creates benchmark, or truth, data by linking the 2005 …

Connecting family trees to construct a population-scale and longitudinal geo-social network for the US

C Koylu, D Guo, Y Huang, A Kasakoff… - International Journal of …, 2021 - Taylor & Francis
ABSTRACT We collected 92,832 user-contributed and publicly available family trees from
rootsweb. com, including 250 million individuals who were born in North America and …

Record linkage in the Cape of Good Hope panel

A Rijpma, J Cilliers, J Fourie - Historical Methods: A Journal of …, 2020 - Taylor & Francis
In this article, we describe the record linkage procedure to create a panel from Cape Colony
census returns, or opgaafrolle, for 1787–1828, a dataset of 42,354 household-level …

Improved similarity assessment and spectral clustering for unsupervised linking of data extracted from bridge inspection reports

K Liu, N El-Gohary - Advanced Engineering Informatics, 2022 - Elsevier
Textual bridge inspection reports are important data sources for supporting data-driven
bridge deterioration prediction and maintenance decision making. Information extraction …

Name2vec: Personal names embeddings

J Foxcroft, A d'Alessandro, L Antonie - … , Kingston, ON, Canada, May 28–31 …, 2019 - Springer
Predicting if two names refer to the same entity is an important task for many domains, such
as information retrieval, record linkage and data integration. In this paper, we propose to …