[HTML][HTML] A semi-automatic data integration process of heterogeneous databases

M Barbella, G Tortora - Pattern Recognition Letters, 2023 - Elsevier
One of the most difficult issues today, is the integration of data from various sources. Thus, it
arises the need of automatic Data Integration (DI) methods. However, in the literature there …

It's ai match: A two-step approach for schema matching using embeddings

B Hättasch, M Truong-Ngoc, A Schmidt… - arXiv preprint arXiv …, 2022 - arxiv.org
Since data is often stored in different sources, it needs to be integrated to gather a global
view that is required in order to create value and derive knowledge from it. A critical step in …

[PDF][PDF] A survey of schema matching research using database schemas and instances

AA Alwan, A Nordin, M Alzeber… - … Journal of Advanced …, 2017 - pdfs.semanticscholar.org
Schema matching is considered as one of the essential phases of data integration in
database systems. The main aim of the schema matching process is to identify the …

A lightweight approach to extract interschema properties from structured, semi-structured and unstructured sources in a big data scenario

F Cauteruccio, PL Giudice, L Musarella… - … Journal of Information …, 2020 - World Scientific
The knowledge of interschema properties (eg, synonymies, homonymies, hyponymies and
subschema similarities) plays a key role for allowing decision-making in sources …

Semantic schema matching for string attribute with word vectors

K Nozaki, T Hochin, H Nomiya - 2019 6th International …, 2019 - ieeexplore.ieee.org
Instance based schema matching is to determine the correspondences between
heterogeneous databases by comparing instances. This process is especially effective …

Semantic schema matching for string attribute with word vectors and its evaluation

K Nozaki, T Hochin, H Nomiya - International Journal of Networked and …, 2019 - Springer
Instance-based schema matching is to determine the correspondences between
heterogeneous databases by comparing instances. Heterogeneous databases consist of an …

[HTML][HTML] Evaluation of Identification Method of Corresponding Numerical Attributes in Heterogeneous Databases Based on Instances

K Nozaki, T Hochin - International Journal of Networked and Distributed …, 2023 - Springer
This paper experimentally evaluates the instance-based schema matching method for the
attributes storing numerical data. This method uses data distributions and correlations …

[图书][B] Exploring Manual Correction as a Source of User Feedback in Pay-As-You-Go Integration

NAA Azuan - 2021 - search.proquest.com
Current practice in data integration typically requires extensive upfront effort, such as
defining a schema mapping before a useful result can be produced. Dataspace is …

Identification of Corresponding Numerical Attributes in Heterogeneous Databases Based on Instances

K Nozaki, T Hochin, H Nomiya - Proceedings of the the 8th International …, 2021 - dl.acm.org
Identification of attributes in heterogeneous databases is widely known as the schema
matching problem, and many studies have been published. Although existing studies using …

Entity Resolution Algorithm for Heterogeneous Data Sources

K Cao, H Liu - … on Computer Information Science and Artificial …, 2021 - ieeexplore.ieee.org
Entity resolution is an important step in data cleaning and data integration, which can
identify records from different data sources that refer to the same entity. For the problem of …