A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

X Huang, D Kroening, W Ruan, J Sharp, Y Sun… - Computer Science …, 2020 - Elsevier
In the past few years, significant progress has been made on deep neural networks (DNNs)
in achieving human-level performance on several long-standing tasks. With the broader …

[HTML][HTML] Testing machine learning based systems: a systematic mapping

V Riccio, G Jahangirova, A Stocco… - Empirical Software …, 2020 - Springer
Abstract Context: A Machine Learning based System (MLS) is a software system including
one or more components that learn how to perform a task from a given data set. The …

[HTML][HTML] A survey of safety and trustworthiness of large language models through the lens of verification and validation

X Huang, W Ruan, W Huang, G Jin, Y Dong… - Artificial Intelligence …, 2024 - Springer
Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …

Machine learning testing: Survey, landscapes and horizons

JM Zhang, M Harman, L Ma… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This paper provides a comprehensive survey of techniques for testing machine learning
systems; Machine Learning Testing (ML testing) research. It covers 144 papers on testing …

Software engineering for AI-based systems: a survey

S Martínez-Fernández, J Bogner, X Franch… - ACM Transactions on …, 2022 - dl.acm.org
AI-based systems are software systems with functionalities enabled by at least one AI
component (eg, for image-, speech-recognition, and autonomous driving). AI-based systems …

Deephunter: a coverage-guided fuzz testing framework for deep neural networks

X Xie, L Ma, F Juefei-Xu, M Xue, H Chen, Y Liu… - Proceedings of the 28th …, 2019 - dl.acm.org
The past decade has seen the great potential of applying deep neural network (DNN) based
software to safety-critical scenarios, such as autonomous driving. Similar to traditional …

Assuring the machine learning lifecycle: Desiderata, methods, and challenges

R Ashmore, R Calinescu, C Paterson - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Machine learning has evolved into an enabling technology for a wide range of highly
successful applications. The potential for this success to continue and accelerate has placed …

Tensorfuzz: Debugging neural networks with coverage-guided fuzzing

A Odena, C Olsson, D Andersen… - … on Machine Learning, 2019 - proceedings.mlr.press
Neural networks are difficult to interpret and debug. We introduce testing techniques for
neural networks that can discover errors occurring only for rare inputs. Specifically, we …

A software engineering perspective on engineering machine learning systems: State of the art and challenges

G Giray - Journal of Systems and Software, 2021 - Elsevier
Context: Advancements in machine learning (ML) lead to a shift from the traditional view of
software development, where algorithms are hard-coded by humans, to ML systems …

Testing deep neural networks

Y Sun, X Huang, D Kroening, J Sharp, M Hill… - arXiv preprint arXiv …, 2018 - arxiv.org
Deep neural networks (DNNs) have a wide range of applications, and software employing
them must be thoroughly tested, especially in safety-critical domains. However, traditional …