How to certify machine learning based safety-critical systems? A systematic literature review

F Tambon, G Laberge, L An, A Nikanjam… - Automated Software …, 2022 - Springer
Abstract Context Machine Learning (ML) has been at the heart of many innovations over the
past years. However, including it in so-called “safety-critical” systems such as automotive or …

Relating adversarially robust generalization to flat minima

D Stutz, M Hein, B Schiele - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Adversarial training (AT) has become the de-facto standard to obtain models robust against
adversarial examples. However, AT exhibits severe robust overfitting: cross-entropy loss on …

On the limitations of dataset balancing: The lost battle against spurious correlations

R Schwartz, G Stanovsky - arXiv preprint arXiv:2204.12708, 2022 - arxiv.org
Recent work has shown that deep learning models in NLP are highly sensitive to low-level
correlations between simple features and specific output labels, leading to overfitting and …

Two coupled rejection metrics can tell adversarial examples apart

T Pang, H Zhang, D He, Y Dong, H Su… - Proceedings of the …, 2022 - openaccess.thecvf.com
Correctly classifying adversarial examples is an essential but challenging requirement for
safely deploying machine learning models. As reported in RobustBench, even the state-of …

Provably robust classification of adversarial examples with detection

F Sheikholeslami, A Lotfi, JZ Kolter - International Conference on …, 2021 - openreview.net
Adversarial attacks against deep networks can be defended against either by building
robust classifiers or, by creating classifiers that can\emph {detect} the presence of …

Improving adversarial robustness via joint classification and multiple explicit detection classes

S Baharlouei, F Sheikholeslami… - International …, 2023 - proceedings.mlr.press
This work concerns the development of deep networks that are certifiably robust to
adversarial attacks. Joint robust classification-detection was recently introduced as a …

Union label smoothing adversarial training: Recognize small perturbation attacks and reject larger perturbation attacks balanced

J Huang, H Xie, C Wu, X Xiang - Future Generation Computer Systems, 2023 - Elsevier
Recently, several adversarial training methods have been proposed for rejecting
perturbation-based adversarial examples, which enhance the robustness of deep neural …

Stratified adversarial robustness with rejection

J Chen, J Raghuram, J Choi, X Wu… - … on machine learning, 2023 - proceedings.mlr.press
Recently, there is an emerging interest in adversarially training a classifier with a rejection
option (also known as a selective classifier) for boosting adversarial robustness. While …

Adversarial training with rectified rejection

T Pang, H Zhang, D He, Y Dong, H Su, W Chen, J Zhu… - 2021 - openreview.net
Adversarial training (AT) is one of the most effective strategies for promoting model
robustness, whereas even the state-of-the-art adversarially trained models struggle to …

Shaping deep feature space towards gaussian mixture for visual classification

W Wan, C Yu, J Chen, T Wu, Y Zhong… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
The softmax cross-entropy loss function has been widely used to train deep models for
various tasks. In this work, we propose a Gaussian mixture (GM) loss function for deep …