Data cleaning and machine learning: a systematic literature review

PO Côté, A Nikanjam, N Ahmed, D Humeniuk… - Automated Software …, 2024 - Springer
Abstract Machine Learning (ML) is integrated into a growing number of systems for various
applications. Because the performance of an ML model is highly dependent on the quality of …

Tabreformer: unsupervised representation learning for erroneous data detection

M Nashaat, A Ghosh, J Miller, S Quader - ACM/IMS Transactions on Data …, 2021 - dl.acm.org
Error detection is a crucial preliminary phase in any data analytics pipeline. Existing error
detection techniques typically target specific types of errors. Moreover, most of these …