Scalable matching and clustering of entities with FAMER

A Saeedi, M Nentwig, E Peukert… - … Systems Informatics and …, 2018 - journals.rtu.lv
Entity resolution identifies semantically equivalent entities, eg describing the same product
or customer. It is especially challenging for Big Data applications where large volumes of …

A big data platform exploiting auditable tokenization to promote good practices inside local energy communities

L Gagliardelli, L Zecchini, L Ferretti… - Future Generation …, 2023 - Elsevier
Abstract The Energy Community Platform (ECP) is a modular system conceived to promote a
conscious use of energy by the users inside local energy communities. It is composed of two …

[PDF][PDF] ECDP: A big data platform for the smart monitoring of local energy communities

L Gagliardelli, L Zecchini, D Beneventano… - CEUR Workshop …, 2022 - iris.unimore.it
In this paper we present the Energy Community Data Platform (ECDP), a middleware
platform designed to support the collection and the analysis of big data about the energy …

Record Linkage Approaches in Big Data: A Comprehensive Review

SF Zahrae, C Ali, A Mohamed - 2024 International Conference …, 2024 - ieeexplore.ieee.org
Analyzing data and making the right decisions have become crucial objectives in various
domains. Record linkage is one of the most important processes for guaranteeing good data …

[PDF][PDF] Entity resolution and data fusion: An integrated approach

D Beneventano, S Bergamaschi… - Proceedings of the …, 2019 - iris.unimore.it
Entity Resolution and Data Fusion are fundamental tasks in a Data Integration process.
Unfortunately, these tasks cannot be completely addressed by purely automated methods …

Progressive entity resolution with node embeddings

G Simonini, L Gagliardelli, M Rinaldi… - CEUR WORKSHOP …, 2022 - iris.unimore.it
Entity Resolution (ER) is the task of finding records that refer to the same real-world entity,
which are called matches. ER is a fundamental pre-processing step when dealing with dirty …

DXP: Billing Data Preparation for Big Data Analytics

L Gagliardelli, D Beneventano, M Esposito… - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we present the data preparation activities that we performed for the Digital
Experience Platform (DXP) project, commissioned and supervised by Doxee SpA. DXP …

[PDF][PDF] Improving the efficiency of clustering algorithm for duplicates detection

A Ali, NA Emran, SSK Baharin, Z Othman… - Indonesian Journal of …, 2023 - academia.edu
Clustering method is a technique used for comparisons reduction between the candidates
records in the duplicate detection process. The process of clustering records is affected by …

[PDF][PDF] An Efficient Multi-Phase Blocking Strategy for Entity Resolution in Big Data

RM Abd El-Ghafar, AH El-Bastawissy… - … Journal Of Innovative …, 2020 - researchgate.net
Entity Resolution (ER) is the process of identifying records that refer to the same real-world
entity. It plays a key role in many applications as data warehouse, data integration, and …

[PDF][PDF] The Case for Multi-task Active Learning Entity Resolution

G Simonini, H Saccani, L Gagliardelli… - CEUR WORKSHOP …, 2021 - iris.unimore.it
Entity Resolution (ER) is a multi-task process that aims to detect different records in dirty
datasets that refer to the same real-world entity. It is a building block for any data integration …