作者
Luiza Antonie, Kris Inwood, Daniel J Lizotte, J Andrew Ross
发表日期
2014/4
期刊
Machine learning
卷号
95
页码范围
129-146
出版商
Springer US
简介
Linking multiple databases to create longitudinal data is an important research problem with multiple applications. Longitudinal data allows analysts to perform studies that would be unfeasible otherwise. We have linked historical census databases to create longitudinal data that allow tracking people over time. These longitudinal data have already been used by social scientists and historians to investigate historical trends and to address questions about society, history and economy, and this comparative, systematic research would not be possible without the linked data. The goal of the linking is to identify the same person in multiple census collections. Data imprecision in historical census data and the lack of unique personal identifiers make this task a challenging one. In this paper we design and employ a record linkage system that incorporates a supervised learning module for classifying pairs of …
引用总数
2013201420152016201720182019202020212022202320242834845811811
学术搜索中的文章