A survey of adversarial defenses and robustness in nlp

S Goyal, S Doddapaneni, MM Khapra… - ACM Computing …, 2023 - dl.acm.org
In the past few years, it has become increasingly evident that deep neural networks are not
resilient enough to withstand adversarial perturbations in input data, leaving them …

Adversarial attacks on deep-learning models in natural language processing: A survey

WE Zhang, QZ Sheng, A Alhazmi, C Li - ACM Transactions on Intelligent …, 2020 - dl.acm.org
With the development of high computational devices, deep neural networks (DNNs), in
recent years, have gained significant popularity in many Artificial Intelligence (AI) …

Towards a robust deep neural network against adversarial texts: A survey

W Wang, R Wang, L Wang, Z Wang… - ieee transactions on …, 2021 - ieeexplore.ieee.org
Deep neural networks (DNNs) have achieved remarkable success in various tasks (eg,
image classification, speech recognition, and natural language processing (NLP)). However …

Improving the reliability of deep neural networks in NLP: A review

B Alshemali, J Kalita - Knowledge-Based Systems, 2020 - Elsevier
Deep learning models have achieved great success in solving a variety of natural language
processing (NLP) problems. An ever-growing body of research, however, illustrates the …

" That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks

E Mosca, S Agarwal, J Rando, G Groh - arXiv preprint arXiv:2204.04636, 2022 - arxiv.org
Adversarial attacks are a major challenge faced by current machine learning research.
These purposely crafted inputs fool even the most advanced models, precluding their …

Adversarial attack and defense technologies in natural language processing: A survey

S Qiu, Q Liu, S Zhou, W Huang - Neurocomputing, 2022 - Elsevier
Recently, the adversarial attack and defense technology has made remarkable
achievements and has been widely applied in the computer vision field, promoting its rapid …

Adversarial attacks and defenses in images, graphs and text: A review

H Xu, Y Ma, HC Liu, D Deb, H Liu, JL Tang… - International journal of …, 2020 - Springer
Deep neural networks (DNN) have achieved unprecedented success in numerous machine
learning tasks in various domains. However, the existence of adversarial examples raises …

Towards improving adversarial training of NLP models

JY Yoo, Y Qi - arXiv preprint arXiv:2109.00544, 2021 - arxiv.org
Adversarial training, a method for learning robust deep neural networks, constructs
adversarial examples during training. However, recent methods for generating NLP …

Natural language adversarial attack and defense in word level

X Wang, H Jin, K He - 2019 - openreview.net
Up until very recently, inspired by a mass of researches on adversarial examples for
computer vision, there has been a growing interest in designing adversarial attacks for …

Towards defending against adversarial examples via attack-invariant features

D Zhou, T Liu, B Han, N Wang… - … on machine learning, 2021 - proceedings.mlr.press
Deep neural networks (DNNs) are vulnerable to adversarial noise. Their adversarial
robustness can be improved by exploiting adversarial examples. However, given the …