Data-centric artificial intelligence: A survey

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - ACM Computing …, 2023 - dl.acm.org
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

Open data: Quality over quantity

S Sadiq, M Indulska - International journal of information management, 2017 - Elsevier
Open data aims to unlock the innovation potential of businesses, governments, and
entrepreneurs, yet it also harbours significant challenges for its effective use. While …

Open data integration

RJ Miller - Proceedings of the VLDB Endowment, 2018 - dl.acm.org
Open data plays a major role in supporting both governmental and organizational
transparency. Many organizations are adopting Open Data Principles promising to make …

Datasynapse: A social data curation foundry

A Beheshti, B Benatallah, A Tabebordbar… - Distributed and Parallel …, 2019 - Springer
Social data analytics have become a vital asset for organizations and governments. For
example, over the last few years, governments started to extract knowledge and derive …

[图书][B] Social data analytics

A Beheshti, S Ghodratnama, M Elahi, H Farhood - 2022 - taylorfrancis.com
This book is an introduction to social data analytics along with its challenges and
opportunities in the age of Big Data and Artificial Intelligence. It focuses primarily on …

Qatch: Benchmarking sql-centric tasks with table representation learning models on your data

S Papicchio, P Papotti… - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract Table Representation Learning (TRL) models are commonly pre-trained on large
open-domain datasets comprising millions of tables and then used to address downstream …

Data quality: The role of empiricism

S Sadiq, T Dasu, XL Dong, J Freire, IF Ilyas… - ACM SIGMOD …, 2018 - dl.acm.org
We outline a call to action for promoting empiricism in data quality research. The action
points result from an analysis of the landscape of data quality research. The landscape …

Towards intelligent feature engineering for risk-based customer segmentation in banking

S Khadivizand, A Beheshti, F Sobhanmanesh… - Proceedings of the 18th …, 2020 - dl.acm.org
Business Processes, ie, a set of coordinated tasks and activities to achieve a business goal,
and their continuous improvements are key to the operation of any organization. In banking …

Gen-T: Table Reclamation in Data Lakes

G Fan, R Shraga, RJ Miller - arXiv preprint arXiv:2403.14128, 2024 - arxiv.org
We introduce the problem of Table Reclamation. Given a Source Table and a large table
repository, reclamation finds a set of tables that, when integrated, reproduce the source table …

Improving data quality in large-scale repositories through conflict resolution

A Kulmukhametov, A Rauber, C Becker - International Journal on Digital …, 2021 - Springer
Digital repositories rely on technical metadata to manage their objects. The output of
characterization tools is aggregated and analyzed through content profiling. The accuracy …