Preventing catastrophic overfitting in fast adversarial training: A bi-level optimization perspective

Z Wang, H Wang, C Tian, Y Jin - European Conference on Computer …, 2024 - Springer
Adversarial training (AT) has become an effective defense method against adversarial
examples (AEs) and it is typically framed as a bi-level optimization problem. Among various …

Adversarial Training: A Survey

M Zhao, L Zhang, J Ye, H Lu, B Yin, X Wang - arXiv preprint arXiv …, 2024 - arxiv.org
Adversarial training (AT) refers to integrating adversarial examples--inputs altered with
imperceptible perturbations that can significantly impact model predictions--into the training …

Rethinking Fast Adversarial Training: A Splitting Technique to Overcome Catastrophic Overfitting

M Zareapoor, P Shamsolmoali - European Conference on Computer …, 2024 - Springer
Catastrophic overfitting (CO) poses a significant challenge to fast adversarial training
(FastAT), particularly at large perturbation scales, leading to dramatic reductions in …

Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency

R Lin, C Yu, B Han, H Su, T Liu - arXiv preprint arXiv:2405.16262, 2024 - arxiv.org
Catastrophic overfitting (CO) presents a significant challenge in single-step adversarial
training (AT), manifesting as highly distorted deep neural networks (DNNs) that are …

Adversarial robustness without perturbations

A Rodríguez Muñoz - 2024 - dspace.mit.edu
Models resistant to adversarial perturbations are stable around the neighbourhoods of input
images, such that small changes, known as adversarial attacks, cannot dramatically change …

[PDF][PDF] A Survey on Image Perturbations for Model Robustness: Attacks and Defenses

PF Zhang, Z Huang - researchgate.net
The widespread adoption of deep neural networks (DNNs) has raised significant concerns
about their robustness, particularly in real-world environments characterized by inherent …