Privacy-preserving record linkage for big data: Current approaches and research challenges

D Vatsalan, Z Sehili, P Christen, E Rahm - Handbook of big data …, 2017 - Springer
Abstract The growth of Big Data, especially personal data dispersed in multiple data
sources, presents enormous opportunities and insights for businesses to explore and …

[PDF][PDF] Author name disambiguation

NR Smalheiser, VI Torvik - Annual review of information science …, 2009 - researchgate.net
For any work of literature, a fundamental issue is to identify the individual (s) who wrote it,
and conversely, to identify all of the works that belong to a given individual. Attribution would …

[图书][B] The data matching process

P Christen, P Christen - 2012 - Springer
This chapter provides an overview of the data matching process, and describes the five
major steps involved in this process: data pre-processing (cleaning and standardisation) …

A survey of indexing techniques for scalable record linkage and deduplication

P Christen - IEEE transactions on knowledge and data …, 2011 - ieeexplore.ieee.org
Record linkage is the process of matching records from several databases that refer to the
same entities. When applied on a single database, this process is known as deduplication …

Data-Centric Systems and Applications

MJ Carey, S Ceri, P Bernstein, U Dayal, C Faloutsos… - Italy: Springer, 2006 - Springer
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …

A taxonomy of privacy-preserving record linkage techniques

D Vatsalan, P Christen, VS Verykios - Information Systems, 2013 - Elsevier
The process of identifying which records in two or more databases correspond to the same
entity is an important aspect of data quality activities such as data pre-processing and data …

Linking sensitive data

P Christen, T Ranbaduge, R Schnell - Methods and techniques for …, 2020 - Springer
Sensitive personal data are created in many application domains, and there is now an
increasing demand to share, integrate, and link such data within and across organisations in …

Adaptive name matching in information integration

M Bilenko, R Mooney, W Cohen… - IEEE Intelligent …, 2003 - ieeexplore.ieee.org
Adaptive name matching in information integration Page 1 16 1094-7167/03/$17.00 © 2003
IEEE IEEE INTELLIGENT SYSTEMS Published by the IEEE Computer Society I nformation I …

A comparison of personal name matching: Techniques and practical issues

P Christen - Sixth IEEE International Conference on Data …, 2006 - ieeexplore.ieee.org
Finding and matching personal names is at the core of an increasing number of
applications: from text and Web mining, search engines, to information extraction …

Febrl- an open source data cleaning, deduplication and record linkage system with a graphical user interface

P Christen - Proceedings of the 14th ACM SIGKDD international …, 2008 - dl.acm.org
Matching records that refer to the same entity across data-bases is becoming an
increasingly important part of many data mining projects, as often data from multiple sources …