Playing it safe: Adversarial robustness with an abstain option

F Tambon, G Laberge, L An, A Nikanjam… - Automated Software …, 2022 - Springer

Abstract Context Machine Learning (ML) has been at the heart of many innovations over the
past years. However, including it in so-called “safety-critical” systems such as automotive or …

被引用次数：83 相关文章所有 7 个版本

[PDF] thecvf.com

Relating adversarially robust generalization to flat minima

D Stutz, M Hein, B Schiele - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Adversarial training (AT) has become the de-facto standard to obtain models robust against
adversarial examples. However, AT exhibits severe robust overfitting: cross-entropy loss on …

被引用次数：67 相关文章所有 13 个版本

[PDF] arxiv.org

On the limitations of dataset balancing: The lost battle against spurious correlations

R Schwartz, G Stanovsky - arXiv preprint arXiv:2204.12708, 2022 - arxiv.org

Recent work has shown that deep learning models in NLP are highly sensitive to low-level
correlations between simple features and specific output labels, leading to overfitting and …

被引用次数：25 相关文章所有 6 个版本

[PDF] thecvf.com

Two coupled rejection metrics can tell adversarial examples apart

T Pang, H Zhang, D He, Y Dong, H Su… - Proceedings of the …, 2022 - openaccess.thecvf.com

Correctly classifying adversarial examples is an essential but challenging requirement for
safely deploying machine learning models. As reported in RobustBench, even the state-of …

被引用次数：21 相关文章所有 7 个版本

[PDF] openreview.net

Provably robust classification of adversarial examples with detection

F Sheikholeslami, A Lotfi, JZ Kolter - International Conference on …, 2021 - openreview.net

Adversarial attacks against deep networks can be defended against either by building
robust classifiers or, by creating classifiers that can\emph {detect} the presence of …

被引用次数：30 相关文章所有 2 个版本

[PDF] mlr.press

Improving adversarial robustness via joint classification and multiple explicit detection classes

S Baharlouei, F Sheikholeslami… - International …, 2023 - proceedings.mlr.press

This work concerns the development of deep networks that are certifiably robust to
adversarial attacks. Joint robust classification-detection was recently introduced as a …

被引用次数：6 相关文章所有 7 个版本

Union label smoothing adversarial training: Recognize small perturbation attacks and reject larger perturbation attacks balanced

J Huang, H Xie, C Wu, X Xiang - Future Generation Computer Systems, 2023 - Elsevier

Recently, several adversarial training methods have been proposed for rejecting
perturbation-based adversarial examples, which enhance the robustness of deep neural …

被引用次数：1 相关文章所有 2 个版本

[PDF] mlr.press

Stratified adversarial robustness with rejection

J Chen, J Raghuram, J Choi, X Wu… - … on machine learning, 2023 - proceedings.mlr.press

Recently, there is an emerging interest in adversarially training a classifier with a rejection
option (also known as a selective classifier) for boosting adversarial robustness. While …

被引用次数：3 相关文章所有 8 个版本

[PDF] openreview.net

Adversarial training with rectified rejection

T Pang, H Zhang, D He, Y Dong, H Su, W Chen, J Zhu… - 2021 - openreview.net

Adversarial training (AT) is one of the most effective strategies for promoting model
robustness, whereas even the state-of-the-art adversarially trained models struggle to …

被引用次数：13 相关文章

[PDF] arxiv.org

Shaping deep feature space towards gaussian mixture for visual classification

W Wan, C Yu, J Chen, T Wu, Y Zhong… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

The softmax cross-entropy loss function has been widely used to train deep models for
various tasks. In this work, we propose a Gaussian mixture (GM) loss function for deep …

被引用次数：9 相关文章所有 10 个版本

高级搜索

QQ 群