Data cleaning and machine learning: a systematic literature review

PO Côté, A Nikanjam, N Ahmed, D Humeniuk… - Automated Software …, 2024 - Springer
Abstract Machine Learning (ML) is integrated into a growing number of systems for various
applications. Because the performance of an ML model is highly dependent on the quality of …

[HTML][HTML] Terrain detection and segmentation for autonomous vehicle navigation: A state-of-the-art systematic review

MM Kabir, JR Jim, Z Istenes - Information Fusion, 2025 - Elsevier
This review comprehensively investigates the current state and emerging trends of
autonomous vehicle terrain detection and segmentation. By systematically reviewing …

Identifying incorrect annotations in multi-label classification data

A Thyagarajan, E Snorrason, C Northcutt… - arXiv preprint arXiv …, 2022 - arxiv.org
In multi-label classification, each example in a dataset may be annotated as belonging to
one or more classes (or none of the classes). Example applications include image (or …

A Novel Benchmark for Refinement of Noisy Localization Labels in Autolabeled Datasets for Object Detection

A Bär, J Uhrig, JP Umesh, M Cordts… - Proceedings of the …, 2023 - openaccess.thecvf.com
Autolabeling approaches are attractive wrt time and cost as they allow fast annotation
without human intervention. However, can we really trust the label quality of autolabeling …

Semi-supervised domain adaptation with CycleGAN guided by a downstream task loss

A Mütze, M Rottmann, H Gottschalk - arXiv preprint arXiv:2208.08815, 2022 - arxiv.org
Domain adaptation is of huge interest as labeling is an expensive and error-prone task,
especially when labels are needed on pixel-level like in semantic segmentation. Therefore …

Identifying label errors in object detection datasets by loss inspection

M Schubert, T Riedlinger, K Kahl… - Proceedings of the …, 2024 - openaccess.thecvf.com
Labeling datasets for supervised object detection is a dull and time-consuming task. Errors
can be easily introduced during annotation and overlooked during review, yielding …

Active Label Correction for Semantic Segmentation with Foundation Models

H Kim, S Hwang, S Kwak, J Ok - arXiv preprint arXiv:2403.10820, 2024 - arxiv.org
Training and validating models for semantic segmentation require datasets with pixel-wise
annotations, which are notoriously labor-intensive. Although useful priors such as …

[PDF][PDF] Semi-Supervised Domain Adaptation with CycleGAN Guided by Downstream Task Awareness.

A Mütze, M Rottmann, H Gottschalk - VISIGRAPP (5: VISAPP), 2023 - scitepress.org
Domain adaptation is of huge interest as labeling is an expensive and error-prone task,
especially on pixellevel like for semantic segmentation. Therefore, one would like to train …

Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks

L Fieback, B Dash, J Spiegelberg… - International Workshop on …, 2023 - Springer
Quantifying predictive uncertainty of deep semantic segmentation networks is essential in
safety-critical tasks. In applications like autonomous driving, where video data is available …

Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks

E Heinert, S Tilgner, T Palm, M Rottmann - arXiv preprint arXiv …, 2024 - arxiv.org
When employing deep neural networks (DNNs) for semantic segmentation in safety-critical
applications like automotive perception or medical imaging, it is important to estimate their …