Testing machine learning based systems: a systematic mapping

V Riccio, G Jahangirova, A Stocco… - Empirical Software …, 2020 - Springer
Abstract Context: A Machine Learning based System (MLS) is a software system including
one or more components that learn how to perform a task from a given data set. The …

A software engineering perspective on engineering machine learning systems: State of the art and challenges

G Giray - Journal of Systems and Software, 2021 - Elsevier
Context: Advancements in machine learning (ML) lead to a shift from the traditional view of
software development, where algorithms are hard-coded by humans, to ML systems …

Machine learning testing: Survey, landscapes and horizons

JM Zhang, M Harman, L Ma… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This paper provides a comprehensive survey of techniques for testing machine learning
systems; Machine Learning Testing (ML testing) research. It covers 144 papers on testing …

Taxonomy of machine learning safety: A survey and primer

S Mohseni, H Wang, C Xiao, Z Yu, Z Wang… - ACM Computing …, 2022 - dl.acm.org
The open-world deployment of Machine Learning (ML) algorithms in safety-critical
applications such as autonomous vehicles needs to address a variety of ML vulnerabilities …

[PDF][PDF] Practical machine learning safety: A survey and primer

S Mohseni, H Wang, Z Yu, C Xiao… - arXiv preprint arXiv …, 2021 - researchgate.net
Authors' addresses: Sina Mohseni, smohseni@ nvidia. com; Zhiding Yu, zhidingy@ nvidia.
com; Chaowei Xiao, chaoweix@ nvidia. com; Jay Yadawa, jyadawa@ nvidia. com, NVIDIA …

Crossasr: Efficient differential testing of automatic speech recognition via text-to-speech

MH Asyrofi, F Thung, D Lo… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Automatic speech recognition (ASR) systems are ubiquitous parts of modern life. It can be
found in our smartphones, desktops, and smart home systems. To ensure its correctness in …

ATOM: Automated Black-Box Testing of Multi-Label Image Classification Systems

S Hu, H Wu, P Wang, J Chang, Y Tu… - 2023 38th IEEE/ACM …, 2023 - ieeexplore.ieee.org
Multi-label Image Classification Systems (MICSs) developed based on Deep Neural
Networks (DNNs) are extensively used in people's daily life. Currently, although there are a …

Quality assurance strategies for machine learning applications in big data analytics: an overview

M Ogrizović, D Drašković, D Bojić - Journal of Big Data, 2024 - Springer
Abstract Machine learning (ML) models have gained significant attention in a variety of
applications, from computer vision to natural language processing, and are almost always …

[HTML][HTML] Assessing operational accuracy of cnn-based image classifiers using an oracle surrogate

A Guerriero, MR Lyu, R Pietrantuono… - Intelligent Systems with …, 2023 - Elsevier
Context Assessing the accuracy in operation of a Machine Learning (ML) system for image
classification on arbitrary (unlabeled) inputs is hard. This is due to the oracle problem, which …

A systematic mapping study on testing of machine learning programs

S Sherin, MZ Iqbal - arXiv preprint arXiv:1907.09427, 2019 - arxiv.org
We aim to conduct a systematic mapping in the area of testing ML programs. We identify,
analyze and classify the existing literature to provide an overview of the area. We followed …