[PDF][PDF] Modeling and Using Biographical Linked Data for Prosopographical Data Analysis

P Leskinen - 2024 - aaltodoc.aalto.fi
Biographical data is used for identifying people, groups, and organizations and for
conveying information about them. Biographical data describes life stories of people with the …

De-identified Bayesian personal identity matching for privacy-preserving record linkage despite errors: development and validation

RN Cardinal, A Moore, M Burchell, JR Lewis - BMC Medical Informatics …, 2023 - Springer
Background Epidemiological research may require linkage of information from multiple
organizations. This can bring two problems:(1) the information governance desirability of …

Augmenting Fact and Date of Death in Electronic Health Records using Internet Media Sources: A Validation Study from Two Large Healthcare Systems

M LeNoue-Newton, MA Al-Garadi, K Ngan, HS Pillai… - medRxiv, 2025 - medrxiv.org
Objective To evaluate the validity of death ascertainment from publicly available internet
media sources by benchmarking against state and Federal vital statics data for patients in …

[HTML][HTML] Generating synthetic identifiers to support development and evaluation of data linkage methods

J Lam, A Boyd, R Linacre… - … Journal of Population …, 2024 - pmc.ncbi.nlm.nih.gov
Introduction Careful development and evaluation of data linkage methods is limited by
researcher access to personal identifiers. One solution is to generate synthetic identifiers …

The Public Utility Data Liberation Project: Providing Open Data For a Clean Energy Transition

K Lamb, E Belfer, Z Selvans, B Norman… - 2024 56th North …, 2024 - ieeexplore.ieee.org
A rapid and equitable energy transition needs a diversity of participants who are empowered
to conduct research and intervene with high-quality accessible energy data. This paper …

Identifying Duplicate Customer Records Using Blocking and Supervised Learning Techniques

A Walker, JA Diaz-Pace… - 2024 IEEE Biennial …, 2024 - ieeexplore.ieee.org
Identifying duplicate entities in databases is crucial for many organizations and often
requires automated processing due to the volume and complexity of records. Blocking …

BeRTo: An Efficient Spark-Based Tool for Linking Business Registries in Big Data Environments

A Colombo, F Invernici - … of the 13th International Conference on …, 2024 - re.public.polimi.it
Linking entities from different datasets is a crucial task for the success of modern
businesses. However, aligning entities becomes challenging as common identifiers might …

Experimental evaluation of record linkage algorithms in a secure banking environment

C Segercrantz - 2023 - aaltodoc.aalto.fi
This thesis studies record linkage algorithms in secure banking environments. Financial
crime prevention laws require financial institutions to identify and monitor high-risk …

How can advancements in the data science space such as vectorization, graph traversals and probabilistic record linkage, enhance entity resolution in multi-source …

F Veips - 2024 - studenttheses.uu.nl
Entity matching is an essential field of study in terms of working with data. Effective
coinciding of the entities with each other can significantly increase the effective output out of …

Deduplikace dat a jejich využití

P Klečanský - 2024 - dk.upce.cz
Diplomová práce se zabývá popisem problematiky deduplikace a spojování záznamu.
Teoretická část zahrnuje celý proces deduplikace, od čištění dat až po klasifikaci. Práce také …