Thirdeye: Attention maps for safe autonomous driving systems

A Stocco, PJ Nunes, M d'Amorim… - Proceedings of the 37th …, 2022 - dl.acm.org
Automated online recognition of unexpected conditions is an indispensable component of
autonomous vehicles to ensure safety even in unknown and uncertain situations. In this …

Efficient and effective feature space exploration for testing deep learning systems

T Zohdinasab, V Riccio, A Gambi… - ACM Transactions on …, 2023 - dl.acm.org
Assessing the quality of Deep Learning (DL) systems is crucial, as they are increasingly
adopted in safety-critical domains. Researchers have proposed several input generation …

An empirical study on data distribution-aware test selection for deep learning enhancement

Q Hu, Y Guo, M Cordy, X Xie, L Ma… - ACM Transactions on …, 2022 - dl.acm.org
Similar to traditional software that is constantly under evolution, deep neural networks need
to evolve upon the rapid growth of test data for continuous enhancement (eg, adapting to …

Simulation-based test case generation for unmanned aerial vehicles in the neighborhood of real flights

S Khatiri, S Panichella, P Tonella - 2023 IEEE Conference on …, 2023 - ieeexplore.ieee.org
Unmanned aerial vehicles (UAVs), also known as drones, are acquiring increasing
autonomy. With their commercial adoption, the problem of testing their functional and non …

When and why test generators for deep learning produce invalid inputs: an empirical study

V Riccio, P Tonella - 2023 IEEE/ACM 45th International …, 2023 - ieeexplore.ieee.org
Testing Deep Learning (DL) based systems inherently requires large and representative test
sets to evaluate whether DL systems generalise beyond their training datasets. Diverse Test …

Deepmetis: Augmenting a deep learning test set to increase its mutation score

V Riccio, N Humbatova, G Jahangirova… - 2021 36th IEEE/ACM …, 2021 - ieeexplore.ieee.org
Deep Learning (DL) components are routinely integrated into software systems that need to
perform complex tasks such as image or natural language processing. The adequacy of the …

Astraea: Grammar-Based Fairness Testing

E Soremekun, S Udeshi… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Software often produces biased outputs. In particular, machine learning (ML) based software
is known to produce erroneous predictions when processing discriminatory inputs. Such …

Regression fuzzing for deep learning systems

H You, Z Wang, J Chen, S Liu… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Deep learning (DL) Systems have been widely used in various domains. Similar to
traditional software, DL system evolution may also incur regression faults. To find the …

Input distribution coverage: Measuring feature interaction adequacy in neural network testing

S Dola, MB Dwyer, ML Soffa - ACM Transactions on Software …, 2023 - dl.acm.org
Testing deep neural networks (DNNs) has garnered great interest in the recent years due to
their use in many applications. Black-box test adequacy measures are useful for guiding the …

Can Coverage Criteria Guide Failure Discovery for Image Classifiers? An Empirical Study

Z Wang, S Xu, L Fan, X Cai, L Li, Z Liu - ACM Transactions on Software …, 2024 - dl.acm.org
Quality assurance of deep neural networks (DNNs) is crucial for the deployment of DNN-
based software, especially in mission-and safety-critical tasks. Inspired by structural white …