Blocking and filtering techniques for entity resolution: A survey

G Papadakis, D Skoutas, E Thanos… - ACM Computing Surveys …, 2020 - dl.acm.org
Entity Resolution (ER), a core task of Data Integration, detects different entity profiles that
correspond to the same real-world object. Due to its inherently quadratic complexity, a series …

A survey of blocking and filtering techniques for entity resolution

G Papadakis, D Skoutas, E Thanos… - arXiv preprint arXiv …, 2019 - arxiv.org
Efficiency techniques are an integral part of Entity Resolution, since its infancy. In this
survey, we organized the bulk of works in the field into Blocking, Filtering and hybrid …

A machine learning-based method for content verification in the E-commerce domain

T Alexakis, N Peppes, K Demestichas, E Adamopoulou - Information, 2022 - mdpi.com
Analysis of extreme-scale data is an emerging research topic; the explosion in available
data raises the need for suitable content verification methods and tools to decrease the …

Siamese graph neural networks for data integration

E Krivosheev, M Atzeni, K Mirylenka, P Scotton… - arXiv preprint arXiv …, 2020 - arxiv.org
Data integration has been studied extensively for decades and approached from different
angles. However, this domain still remains largely rule-driven and lacks universal …

Improved similarity assessment and spectral clustering for unsupervised linking of data extracted from bridge inspection reports

K Liu, N El-Gohary - Advanced Engineering Informatics, 2022 - Elsevier
Textual bridge inspection reports are important data sources for supporting data-driven
bridge deterioration prediction and maintenance decision making. Information extraction …

Managing Personal Identifiable Information in Data Lakes

D Oreščanin, T Hlupić, B Vrdoljak - IEEE access, 2024 - ieeexplore.ieee.org
Privacy is a fundamental human right according to the Universal Declaration of Human
Rights of the United Nations. Adoption of the General Data Protection Regulation (GDPR) in …

Exploring Federated Learning for Data Integration: A Structured Literature Review

JP Awick, G Schumann… - … Conference on Big Data …, 2023 - ieeexplore.ieee.org
Data integration is utilized to integrate heterogeneous data from multiple sources,
representing a crucial step to improve information value in data analysis and mining …

Business entity matching with siamese graph convolutional networks

E Krivosheev, M Atzeni, K Mirylenka… - Proceedings of the …, 2021 - ojs.aaai.org
Data integration has been studied extensively for decades and approached from different
angles. However, this domain still remains largely rule-driven and lacks universal …

CompanyName2Vec: Company entity matching based on job ads

R Ziv, I Gronau, M Fire - 2022 IEEE 9th International …, 2022 - ieeexplore.ieee.org
Entity Matching is an essential part of all real-world systems that take in structured and
unstructured data coming from different sources. Typically no common key is available for …

Developing a legal form classification and extraction approach for company entity matching: Benchmark of rule-based and machine learning approaches

F Kruse, JP Awick, JM Gómez, P Loos - Business Information Systems, 2021 - tib-op.org
This paper explores the data integration process step record linkage. Thereby we focus on
the entity company. For the integration of company data, the company name is a crucial …