An overview of end-to-end entity resolution for big data

V Christophides, V Efthymiou, T Palpanas… - ACM Computing …, 2020 - dl.acm.org
One of the most critical tasks for improving data quality and increasing the reliability of data
analytics is Entity Resolution (ER), which aims to identify different descriptions that refer to …

Blocking and filtering techniques for entity resolution: A survey

G Papadakis, D Skoutas, E Thanos… - ACM Computing Surveys …, 2020 - dl.acm.org
Entity Resolution (ER), a core task of Data Integration, detects different entity profiles that
correspond to the same real-world object. Due to its inherently quadratic complexity, a series …

[PDF][PDF] 大数据的-个重要方面数据可用性

李建中, 刘显敏 - 计算机研究与发展, 2013 - cs.sjtu.edu.cn
摘要!"# $% &'()*+,-.# $/0 123 4567893:;% &'<=>?@ ABCDEF GFHI# $8 J'KLMN
OPQRSTU@'VWIABXYZ [\],@ AB'KLVW^ _I!" AB'aZbc deABQ!^ fS ABXYZghiKjk l# $8 J …

[图书][B] Knowledge graphs

D Fensel, U Simsek, K Angele, E Huaman, E Kärle… - 2020 - Springer
Smart speakers such as Alexa and Google Home introduced Artificial Intelligence (AI) in
millions soon billions of households, making AI an everyday experience. We can now look …

Data and information quality

C Batini, M Scannapieco - Cham, Switzerland: Springer International …, 2016 - Springer
This book is the result of a study path that started in 2006, when the two authors of this book
published the book Data Quality: Concepts, Methodologies and Techniques. After 8 years …

[图书][B] The data matching process

P Christen, P Christen - 2012 - Springer
This chapter provides an overview of the data matching process, and describes the five
major steps involved in this process: data pre-processing (cleaning and standardisation) …

User identity linkage across online social networks: A review

K Shu, S Wang, J Tang, R Zafarani, H Liu - Acm Sigkdd Explorations …, 2017 - dl.acm.org
The increasing popularity and diversity of social media sites has encouraged more and
more people to participate on multiple online social networks to enjoy their services. Each …

Data-Centric Systems and Applications

MJ Carey, S Ceri, P Bernstein, U Dayal, C Faloutsos… - Italy: Springer, 2006 - Springer
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …

NADEEF: a commodity data cleaning system

M Dallachiesa, A Ebaid, A Eldawy… - Proceedings of the …, 2013 - dl.acm.org
Despite the increasing importance of data quality and the rich theoretical and practical
contributions in all aspects of data cleaning, there is no single end-to-end off-the-shelf …

Entity resolution: theory, practice & open challenges

L Getoor, A Machanavajjhala - Proceedings of the VLDB Endowment, 2012 - dl.acm.org
This tutorial brings together perspectives on ER from a variety of fields, including databases,
machine learning, natural language processing and information retrieval, to provide, in one …