NR Smalheiser, VI Torvik - Annual review of information science …, 2009 - researchgate.net
For any work of literature, a fundamental issue is to identify the individual (s) who wrote it, and conversely, to identify all of the works that belong to a given individual. Attribution would …
This chapter provides an overview of the data matching process, and describes the five major steps involved in this process: data pre-processing (cleaning and standardisation) …
P Christen - IEEE transactions on knowledge and data …, 2011 - ieeexplore.ieee.org
Record linkage is the process of matching records from several databases that refer to the same entities. When applied on a single database, this process is known as deduplication …
The rapid growth of the Web in the past two decades has made it the largest publicly accessible data source in the world. Web mining aims to discover useful information or …
The process of identifying which records in two or more databases correspond to the same entity is an important aspect of data quality activities such as data pre-processing and data …
Sensitive personal data are created in many application domains, and there is now an increasing demand to share, integrate, and link such data within and across organisations in …
P Christen - Sixth IEEE International Conference on Data …, 2006 - ieeexplore.ieee.org
Finding and matching personal names is at the core of an increasing number of applications: from text and Web mining, search engines, to information extraction …
P Christen - Proceedings of the 14th ACM SIGKDD international …, 2008 - dl.acm.org
Matching records that refer to the same entity across data-bases is becoming an increasingly important part of many data mining projects, as often data from multiple sources …