Attacks which do not kill training make adversarial learning stronger

T Bai, J Luo, J Zhao, B Wen, Q Wang - arXiv preprint arXiv:2102.01356, 2021 - arxiv.org

Adversarial training is one of the most effective approaches defending against adversarial
examples for deep learning models. Unlike other defense strategies, adversarial training …

被引用次数：417 相关文章所有 6 个版本

[PDF] arxiv.org

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

被引用次数：104 相关文章所有 3 个版本

[PDF] mlr.press

Better diffusion models further improve adversarial training

Z Wang, T Pang, C Du, M Lin… - … on Machine Learning, 2023 - proceedings.mlr.press

It has been recognized that the data generated by the denoising diffusion probabilistic
model (DDPM) improves adversarial training. After two years of rapid development in …

被引用次数：144 相关文章所有 9 个版本

[PDF] mlr.press

Cross-entropy loss functions: Theoretical analysis and applications

A Mao, M Mohri, Y Zhong - International conference on …, 2023 - proceedings.mlr.press

Cross-entropy is a widely used loss function in applications. It coincides with the logistic loss
applied to the outputs of a neural network, when the softmax is used. But, what guarantees …

被引用次数：142 相关文章所有 7 个版本

[PDF] arxiv.org

Robustbench: a standardized adversarial robustness benchmark

F Croce, M Andriushchenko, V Sehwag… - arXiv preprint arXiv …, 2020 - arxiv.org

As a research community, we are still lacking a systematic understanding of the progress on
adversarial robustness which often makes it hard to identify the most promising ideas in …

被引用次数：641 相关文章所有 13 个版本

[PDF] thecvf.com

LAS-AT: adversarial training with learnable attack strategy

X Jia, Y Zhang, B Wu, K Ma… - Proceedings of the …, 2022 - openaccess.thecvf.com

Adversarial training (AT) is always formulated as a minimax problem, of which the
performance depends on the inner optimization that involves the generation of adversarial …

被引用次数：136 相关文章所有 5 个版本

[PDF] thecvf.com

On the robustness of vision transformers to adversarial examples

K Mahmood, R Mahmood… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Recent advances in attention-based networks have shown that Vision Transformers can
achieve state-of-the-art or near state-of-the-art results on many image classification tasks …

被引用次数：221 相关文章所有 10 个版本

[PDF] neurips.cc

Robust pre-training by adversarial contrastive learning

Z Jiang, T Chen, T Chen… - Advances in neural …, 2020 - proceedings.neurips.cc

Recent work has shown that, when integrated with adversarial training, self-supervised pre-
training can lead to state-of-the-art robustness In this work, we improve robustness-aware …

被引用次数：219 相关文章所有 7 个版本

[PDF] neurips.cc

Augmax: Adversarial composition of random augmentations for robust training

H Wang, C Xiao, J Kossaifi, Z Yu… - Advances in neural …, 2021 - proceedings.neurips.cc

Data augmentation is a simple yet effective way to improve the robustness of deep neural
networks (DNNs). Diversity and hardness are two complementary dimensions of data …

被引用次数：106 相关文章所有 10 个版本

[PDF] neurips.cc

Exploring architectural ingredients of adversarially robust deep neural networks

H Huang, Y Wang, S Erfani, Q Gu… - Advances in Neural …, 2021 - proceedings.neurips.cc

Deep neural networks (DNNs) are known to be vulnerable to adversarial attacks. A range of
defense methods have been proposed to train adversarially robust DNNs, among which …

被引用次数：103 相关文章所有 6 个版本

高级搜索

QQ 群