Interactive and deterministic data cleaning

J He, E Veltri, D Santoro, G Li, G Mecca… - Proceedings of the …, 2016 - dl.acm.org
We present Falcon, an interactive, deterministic, and declarative data cleaning system,
which uses SQL update queries as the language to repair data. Falcon does not rely on the …

Query-oriented data cleaning with oracles

M Bergman, T Milo, S Novgorodov… - Proceedings of the 2015 …, 2015 - dl.acm.org
As key decisions are often made based on information contained in a database, it is
important for the database to be as complete and correct as possible. For this reason, many …

KATARA: Reliable data cleaning with knowledge bases and crowdsourcing

X Chu, J Morcos, IF Ilyas, M Ouzzani, P Papotti… - Proceedings of the …, 2015 - dl.acm.org
Data cleaning with guaranteed reliability is hard to achieve without accessing external
sources, since the truth is not necessarily discoverable from the data at hand. Furthermore …

Cleaning data with llunatic

F Geerts, G Mecca, P Papotti, D Santoro - The VLDB Journal, 2020 - Springer
Data cleaning (or data repairing) is considered a crucial problem in many database-related
tasks. It consists in making a database consistent with respect to a given set of constraints. In …

NADEEF: a commodity data cleaning system

M Dallachiesa, A Ebaid, A Eldawy… - Proceedings of the …, 2013 - dl.acm.org
Despite the increasing importance of data quality and the rich theoretical and practical
contributions in all aspects of data cleaning, there is no single end-to-end off-the-shelf …

Descriptive and prescriptive data cleaning

A Chalamalla, IF Ilyas, M Ouzzani… - Proceedings of the 2014 …, 2014 - dl.acm.org
Data cleaning techniques usually rely on some quality rules to identify violating tuples, and
then fix these violations using some repair algorithms. Oftentimes, the rules, which are …

Messing up with BART: error generation for evaluating data-cleaning algorithms

PC Arocena, B Glavic, G Mecca, RJ Miller… - Proceedings of the …, 2015 - dl.acm.org
We study the problem of introducing errors into clean databases for the purpose of
benchmarking data-cleaning algorithms. Our goal is to provide users with the highest …

Bigdansing: A system for big data cleansing

Z Khayyat, IF Ilyas, A Jindal, S Madden… - Proceedings of the …, 2015 - dl.acm.org
Data cleansing approaches have usually focused on detecting and fixing errors with little
attention to scaling to big datasets. This presents a serious impediment since data cleansing …

An extensible framework for data cleaning

H Galhardas, D Florescu, D Shasha, E Simon - 1999 - inria.hal.science
Data integration solutions dealing with large amounts of data have been strongly required in
the last few years. Besides the traditional data integration problems (eg schema integration …

QOCO: A query oriented data cleaning system with oracles

M Bergman, T Milo, S Novgorodov… - Proceedings of the VLDB …, 2015 - dl.acm.org
As key decisions are often made based on information contained in a database, it is
important for the database to be as complete and correct as possible. For this reason, many …