R Wang, Y Li, J Wang - 2023 IEEE 39th International …, 2023 - ieeexplore.ieee.org
Machine learning (ML) is playing an increasingly important role in data management tasks, particularly in Data Integration and Preparation (DI&P). The success of ML-based …
Currently, data are often referred to as the oil of the 21st century. This comparison is not only used to express that the resource data are just as important for the fourth industrial …
Recent work has made significant progress in helping users to automate single data preparation steps, such as string-transformations and table-manipulation operators (eg …
Software systems that learn from data with machine learning (ML) are used in critical decision-making processes. Unfortunately, real-world experience shows that the pipelines …
X Ding, G Li, H Wang, C Wang… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Time series data generated by thousands of sensors are suffering data quality problems. Traditional constraint-based techniques have greatly contributed to data cleaning …
String data is common in real-world datasets: 67.6% of values in a sample of 1.8 million real Excel spreadsheets from the web were represented as text. Systems that successfully clean …
C Sancricca, G Siracusa, C Cappiello - Journal of Intelligent Information …, 2024 - Springer
Data play a key role in AI systems that support decision-making processes. Data-centric AI highlights the importance of having high-quality input data to obtain reliable results …
M Yan, Y Wang, Y Wang, X Miao, J Li - … of the ACM on Management of …, 2024 - dl.acm.org
Data quality is critical across many applications. The utility of data is undermined by various errors, making rigorous data cleaning a necessity. Traditional data cleaning systems depend …
Data preparation is the process of normalizing, cleaning, transforming, and combining data prior to processing or analysis. It is crucial for obtaining valuable results from data analysis …