A survey of adversarial defenses and robustness in nlp

S Goyal, S Doddapaneni, MM Khapra… - ACM Computing …, 2023 - dl.acm.org
In the past few years, it has become increasingly evident that deep neural networks are not
resilient enough to withstand adversarial perturbations in input data, leaving them …

Adversarial attacks and defenses in machine learning-empowered communication systems and networks: A contemporary survey

Y Wang, T Sun, S Li, X Yuan, W Ni… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
Adversarial attacks and defenses in machine learning and deep neural network (DNN) have
been gaining significant attention due to the rapidly growing applications of deep learning in …

Probabilistic categorical adversarial attack and adversarial training

H Xu, P He, J Ren, Y Wan, Z Liu… - … on Machine Learning, 2023 - proceedings.mlr.press
The studies on adversarial attacks and defenses have greatly improved the robustness of
Deep Neural Networks (DNNs). Most advanced approaches have been overwhelmingly …

Reducing sentiment bias in pre-trained sentiment classification via adaptive gumbel attack

J Tian, S Chen, X Zhang, X Wang, Z Feng - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Pre-trained language models (PLMs) have recently enabled rapid progress on sentiment
classification under the pre-train and fine-tune paradigm, where the fine-tuning phase aims …

Adversarial nlp for social network applications: Attacks, defenses, and research directions

I Alsmadi, K Ahmad, M Nazzal, F Alam… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
The growing use of media has led to the development of several machine learning (ML) and
natural language processing (NLP) tools to process the unprecedented amount of social …

[PDF][PDF] A survey in adversarial defences and robustness in nlp

G Shreya, MM Khapra - arXiv preprint arXiv:2203.06414, 2022 - researchgate.net
Authors' addresses: Shreya Goyal, Robert Bosch Centre for Data Science and AI, Indian
Institute of Technology Madras, Bhupat and Jyoti Mehta School of Biosciences,, Chennai …

Finest: Stabilizing recommendations by rank-preserving fine-tuning

S Oh, B Ustun, J McAuley, S Kumar - arXiv preprint arXiv:2402.03481, 2024 - arxiv.org
Modern recommender systems may output considerably different recommendations due to
small perturbations in the training data. Changes in the data from a single user will alter the …

Robustness of models addressing Information Disorder: A comprehensive review and benchmarking study

G Fenza, V Loia, C Stanzione, M Di Gisi - Neurocomputing, 2024 - Elsevier
Abstract Machine learning and deep learning models are increasingly susceptible to
adversarial attacks, particularly in critical areas like cybersecurity and Information Disorder …

Geometrically-aggregated training samples: Leveraging summary statistics to enable healthcare data democratization

J Yang, A Thakur, AAS Soltan, DA Clifton - medRxiv, 2023 - medrxiv.org
Healthcare data is highly sensitive and confidential, with strict regulations and laws to
protect patient privacy and security. However, these regulations impede the access of …

MaskPure: Improving Defense Against Text Adversaries with Stochastic Purification

H Gietz, J Kalita - arXiv preprint arXiv:2406.13066, 2024 - arxiv.org
The improvement of language model robustness, including successful defense against
adversarial attacks, remains an open problem. In computer vision settings, the stochastic …