Re-thinking data strategy and integration for artificial intelligence: concepts, opportunities, and challenges

A Aldoseri, KN Al-Khalifa, AM Hamouda - Applied Sciences, 2023 - mdpi.com
The use of artificial intelligence (AI) is becoming more prevalent across industries such as
healthcare, finance, and transportation. Artificial intelligence is based on the analysis of …

Data-centric ai: Perspectives and challenges

D Zha, ZP Bhat, KH Lai, F Yang, X Hu - Proceedings of the 2023 SIAM …, 2023 - SIAM
The role of data in building AI systems has recently been significantly magnified by the
emerging concept of data-centric AI (DCAI), which advocates a fundamental shift from model …

Data-centric artificial intelligence: A survey

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - arXiv preprint arXiv …, 2023 - arxiv.org
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

Consistency regularization training for compositional generalization

Y Yin, J Zeng, Y Li, F Meng, J Zhou… - Proceedings of the 61st …, 2023 - aclanthology.org
Existing neural models have difficulty generalizing to unseen combinations of seen
components. To achieve compositional generalization, models are required to consistently …

Make-an-audio 2: Temporal-enhanced text-to-audio generation

J Huang, Y Ren, R Huang, D Yang, Z Ye… - arXiv preprint arXiv …, 2023 - arxiv.org
Large diffusion models have been successful in text-to-audio (T2A) synthesis tasks, but they
often suffer from common issues such as semantic misalignment and poor temporal …

Data-centric graph learning: A survey

C Yang, D Bo, J Liu, Y Peng, B Chen, H Dai… - arXiv preprint arXiv …, 2023 - arxiv.org
The history of artificial intelligence (AI) has witnessed the significant impact of high-quality
data on various deep learning models, such as ImageNet for AlexNet and ResNet. Recently …

Data-centric green artificial intelligence: A survey

S Salehi, A Schmeink - IEEE Transactions on Artificial …, 2023 - ieeexplore.ieee.org
With the exponential growth of computational power and the availability of large-scale
datasets in recent years, remarkable advancements have been made in the field of artificial …

Sanitizing data for analysis: Designing systems for data understanding

J Holstein, M Schemmer, J Jakubik, M Vössing… - Electronic Markets, 2023 - Springer
As organizations accumulate vast amounts of data for analysis, a significant challenge
remains in fully understanding these datasets to extract accurate information and generate …

ydata-profiling: Accelerating data-centric AI with high-quality data

F Clemente, GM Ribeiro, A Quemy, MS Santos… - Neurocomputing, 2023 - Elsevier
Abstract ydata-profiling is an open-source Python package for advanced exploratory data
analysis that enables users to generate data profiling reports in a simple, fast, and efficient …

From concept to implementation: the data-centric development process for AI in industry

PP Luley, JM Deriu, P Yan, GA Schatte… - 2023 10th IEEE …, 2023 - ieeexplore.ieee.org
We examine the paradigm of data-centric artificial intelligence (DCAI) as a solution to the
obstacles that small and medium-sized enterprises (SMEs) face in adopting AI. While the …