P Christen - IEEE transactions on knowledge and data …, 2011 - ieeexplore.ieee.org
Record linkage is the process of matching records from several databases that refer to the same entities. When applied on a single database, this process is known as deduplication …
This tutorial provides a comprehensive and cohesive overview of the key research results in the area of record linkage methodologies and algorithms for identifying approximate …
L Gu, R Baxter, D Vickers, C Rainsford - CSIRO Mathematical and …, 2003 - Citeseer
Record linkage is the task of quickly and accurately identifying records corresponding to the same entity from one or more data sources. Record linkage is also known as data cleaning …
In an error-free system with perfectly clean data, the construction of a global view of the data consists of linking-in relational terms, joining-two or more tables on their key fields …
Record linkage, the problem of determining when two records refer to the same entity, has applications for both data cleaning (deduplication) and for integrating data from multiple …
Data quality has many dimensions one of which is accuracy. Accuracy is usually compromised by errors accidentally or intensionally introduced in a database system. These …
Record linkage methods for multidatabase data mining Page 1 Record linkage methods for multidatabase data mining Vicenc; Torral and Josep Domingo-Ferrer2 1 Institut d'Investigaci6 …
P Christen, K Goiser - Quality measures in data mining, 2007 - Springer
Deduplicating one data set or linking several data sets are increasingly important tasks in the data preparation steps of many data mining projects. The aim of such linkages is to …
P Christen - Proceedings of the 14th ACM SIGKDD international …, 2008 - dl.acm.org
The task of linking databases is an important step in an increasing number of data mining projects, because linked data can contain information that is not available otherwise, or that …