(Almost) all of entity resolution

O Binette, RC Steorts - Science Advances, 2022 - science.org
Whether the goal is to estimate the number of people that live in a congressional district, to
estimate the number of individuals that have died in an armed conflict, or to disambiguate …

Sparkly: A simple yet surprisingly strong TF/IDF blocker for entity matching

D Paulsen, Y Govind, AH Doan - Proceedings of the VLDB Endowment, 2023 - dl.acm.org
Blocking is a major task in entity matching. Numerous blocking solutions have been
developed, but as far as we can tell, blocking using the well-known tf/idf measure has …

A qualitative literature review on microservices identification approaches

C Schröer, F Kruse, J Marx Gómez - … , Greece, September 13-19, 2020 14, 2020 - Springer
Microservices has become a widely used and discussed architectural style for designing
modern applications due to advantages like granular scalability and maintainability …

Medical entity disambiguation using graph neural networks

A Vretinaris, C Lei, V Efthymiou, X Qin… - Proceedings of the 2021 …, 2021 - dl.acm.org
Medical knowledge bases (KBs), distilled from biomedical literature and regulatory actions,
are expected to provide high-quality information to facilitate clinical decision making. Entity …

Magellan: toward building ecosystems of entity matching solutions

AH Doan, P Konda, P Suganthan GC… - Communications of the …, 2020 - dl.acm.org
Entity matching (EM) finds data instances that refer to the same real-world entity. In 2015, we
started the Magellan project at UW-Madison, jointly with industrial partners, to build EM …

RDFFrames: knowledge graph access for machine learning tools

A Mohamed, G Abuoda, A Ghanem, Z Kaoudi… - The VLDB Journal, 2022 - Springer
Abstract Knowledge graphs represented as RDF datasets are integral to many machine
learning applications. RDF is supported by a rich ecosystem of data management systems …

Semantic enrichment of data for AI applications

F Özcan, C Lei, A Quamar, V Efthymiou - … of the Fifth Workshop on Data …, 2021 - dl.acm.org
In this work, we use semantic knowledge sources, such as cross-domain knowledge graphs
(KGs) and domain-specific ontologies, to enrich structured data for various AI applications …

Fairness-Aware Data Preparation for Entity Matching

N Shahbazi, J Wang, Z Miao… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Entity matching is a crucial task in many real applications. Despite the substantial body of
research that focuses on improving the effectiveness of entity matching, enhancing its …

A survey of blocking and filtering techniques for entity resolution

G Papadakis, D Skoutas, E Thanos… - arXiv preprint arXiv …, 2019 - arxiv.org
Efficiency techniques are an integral part of Entity Resolution, since its infancy. In this
survey, we organized the bulk of works in the field into Blocking, Filtering and hybrid …

Synthesizing privacy preserving entity resolution datasets

X Qinl, C Chai, N Tang, J Li, Y Luo… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
Entity resolution (ER) is a core problem in data integration. Many companies have lots of
datasets where ER needs to be conducted to integrate the data. On the one hand, it is …