Data-centric artificial intelligence: A survey

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - ACM Computing …, 2023 - dl.acm.org
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

Data lake management: challenges and opportunities

F Nargesian, E Zhu, RJ Miller, KQ Pu… - Proceedings of the VLDB …, 2019 - dl.acm.org
The ubiquity of data lakes has created fascinating new challenges for data management
research. In this tutorial, we review the state-of-the-art in data management for data lakes …

Data collection and quality challenges in deep learning: A data-centric ai perspective

SE Whang, Y Roh, H Song, JG Lee - The VLDB Journal, 2023 - Springer
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019 - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

How large language models will disrupt data management

RC Fernandez, AJ Elmore, MJ Franklin… - Proceedings of the …, 2023 - dl.acm.org
Large language models (LLMs), such as GPT-4, are revolutionizing software's ability to
understand, process, and synthesize language. The authors of this paper believe that this …

[HTML][HTML] Dataset search: a survey

A Chapman, E Simperl, L Koesten, G Konstantinidis… - The VLDB Journal, 2020 - Springer
Generating value from data requires the ability to find, access and make sense of datasets.
There are many efforts underway to encourage data sharing and reuse, from scientific …

Dataset discovery in data lakes

A Bogatu, AAA Fernandes, NW Paton… - 2020 ieee 36th …, 2020 - ieeexplore.ieee.org
Data analytics stands to benefit from the increasing availability of datasets that are held
without their conceptual relationships being explicitly known. When collected, these datasets …

AI meets database: AI4DB and DB4AI

G Li, X Zhou, L Cao - Proceedings of the 2021 International Conference …, 2021 - dl.acm.org
Database and Artificial Intelligence (AI) can benefit from each other. On one hand, AI can
make database more intelligent (AI4DB). For example, traditional empirical database …

Data market platforms: Trading data assets to solve data problems

RC Fernandez, P Subramaniam… - arXiv preprint arXiv …, 2020 - arxiv.org
Data only generates value for a few organizations with expertise and resources to make
data shareable, discoverable, and easy to integrate. Sharing data that is easy to discover …

Database meets artificial intelligence: A survey

X Zhou, C Chai, G Li, J Sun - IEEE Transactions on Knowledge …, 2020 - ieeexplore.ieee.org
Database and Artificial Intelligence (AI) can benefit from each other. On one hand, AI can
make database more intelligent (AI4DB). For example, traditional empirical database …