Discover dependencies from data—a review

J Liu, J Li, C Liu, Y Chen - IEEE Transactions on Knowledge …, 2010 - ieeexplore.ieee.org
Functional and inclusion dependency discovery is important to knowledge discovery,
database semantics analysis, database design, and data quality assessment. Motivated by …

[图书][B] Data cleaning

IF Ilyas, X Chu - 2019 - books.google.com
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …

Profiling relational data: a survey

Z Abedjan, L Golab, F Naumann - The VLDB Journal, 2015 - Springer
Profiling data to determine metadata about a given dataset is an important and frequent
activity of any IT professional and researcher and is necessary for various use-cases. It …

Functional dependency discovery: An experimental evaluation of seven algorithms

T Papenbrock, J Ehrlich, J Marten, T Neubert… - Proceedings of the …, 2015 - dl.acm.org
Functional dependencies are important metadata used for schema normalization, data
cleansing and many other tasks. The efficient discovery of functional dependencies in tables …

Santos: Relationship-based semantic table union search

A Khatiwada, G Fan, R Shraga, Z Chen… - Proceedings of the …, 2023 - dl.acm.org
Existing techniques for unionable table search define unionability using metadata (tables
must have the same or similar schemas) or column-based metrics (for example, the values …

[PDF][PDF] Subgroup discovery with CN2-SD

N Lavrac, B Kavsek, P Flach, L Todorovski - J. Mach. Learn. Res., 2004 - jmlr.org
This paper investigates how to adapt standard classification rule learning approaches to
subgroup discovery. The goal of subgroup discovery is to find rules describing subsets of the …

Discovering conditional functional dependencies

W Fan, F Geerts, J Li, M Xiong - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
This paper investigates the discovery of conditional functional dependencies (CFDs). CFDs
are a recent extension of functional dependencies (FDs) by supporting patterns of …

Rule evaluation measures: A unifying view

N Lavrač, P Flach, B Zupan - International conference on inductive logic …, 1999 - Springer
Numerous measures are used for performance evaluation in machine learning. In predictive
knowledge discovery, the most frequently used measure is classification accuracy. With new …

A hybrid approach to functional dependency discovery

T Papenbrock, F Naumann - … of the 2016 International Conference on …, 2016 - dl.acm.org
Functional dependencies are structural metadata that can be used for schema
normalization, data integration, data cleansing, and many other data management tasks …

Fauce: fast and accurate deep ensembles with uncertainty for cardinality estimation

J Liu, W Dong, Q Zhou, D Li - Proceedings of the VLDB Endowment, 2021 - dl.acm.org
Cardinality estimation is a fundamental and critical problem in databases. Recently, many
estimators based on deep learning have been proposed to solve this problem and they have …