相关文章- 学术资源搜索

Defending pre-trained language models from adversarial word substitutions without performance sacrifice

R Bao, J Wang, H Zhao - arXiv preprint arXiv:2105.14553, 2021 - arxiv.org

Pre-trained contextualized language models (PrLMs) have led to strong performance gains
in downstream natural language understanding tasks. However, PrLMs can still be easily …

被引用次数：38 相关文章所有 3 个版本

[PDF] arxiv.org

Rethinking textual adversarial defense for pre-trained language models

J Wang, R Bao, Z Zhang, H Zhao - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org

Although pre-trained language models (PrLMs) have achieved significant success, recent
studies demonstrate that PrLMs are vulnerable to adversarial attacks. By generating …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

Adversarial glue: A multi-task benchmark for robustness evaluation of language models

B Wang, C Xu, S Wang, Z Gan, Y Cheng, J Gao… - arXiv preprint arXiv …, 2021 - arxiv.org

Large-scale pre-trained language models have achieved tremendous success across a
wide range of natural language understanding (NLU) tasks, even surpassing human …

被引用次数：161 相关文章所有 6 个版本

[PDF] arxiv.org

Defense against adversarial attacks in nlp via dirichlet neighborhood ensemble

Y Zhou, X Zheng, CJ Hsieh, K Chang… - arXiv preprint arXiv …, 2020 - arxiv.org

Despite neural networks have achieved prominent performance on many natural language
processing (NLP) tasks, they are vulnerable to adversarial examples. In this paper, we …

被引用次数：47 相关文章所有 2 个版本

[PDF] arxiv.org

Better robustness by more coverage: Adversarial training with mixup augmentation for robust fine-tuning

C Si, Z Zhang, F Qi, Z Liu, Y Wang, Q Liu… - arXiv preprint arXiv …, 2020 - arxiv.org

Pretrained language models (PLMs) perform poorly under adversarial attacks. To improve
the adversarial robustness, adversarial data augmentation (ADA) has been widely adopted …

被引用次数：91 相关文章所有 4 个版本

[PDF] aclanthology.org

Rmlm: A flexible defense framework for proactively mitigating word-level adversarial attacks

Z Wang, Z Liu, X Zheng, Q Su… - Proceedings of the 61st …, 2023 - aclanthology.org

Adversarial attacks on deep neural networks keep raising security concerns in natural
language processing research. Existing defenses focus on improving the robustness of the …

被引用次数：4 相关文章所有 2 个版本

[PDF] ijcai.org

[PDF][PDF] Towards Semantics-and Domain-Aware Adversarial Attacks.

J Zhang, YC Huang, W Wu, MR Lyu - IJCAI, 2023 - ijcai.org

Abstract Language models are known to be vulnerable to textual adversarial attacks, which
add humanimperceptible perturbations to the input to mislead DNNs. It is thus imperative to …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Phrase-level textual adversarial attack with label preservation

Y Lei, Y Cao, D Li, T Zhou, M Fang… - arXiv preprint arXiv …, 2022 - arxiv.org

Generating high-quality textual adversarial examples is critical for investigating the pitfalls of
natural language processing (NLP) models and further promoting their robustness. Existing …

被引用次数：17 相关文章所有 8 个版本

[PDF] arxiv.org

Searching for an effective defender: Benchmarking defense against adversarial word substitution

Z Li, J Xu, J Zeng, L Li, X Zheng, Q Zhang… - arXiv preprint arXiv …, 2021 - arxiv.org

Recent studies have shown that deep neural networks are vulnerable to intentionally crafted
adversarial examples, and various methods have been proposed to defend against …

被引用次数：65 相关文章所有 4 个版本

[PDF] nsf.gov

[PDF][PDF] Defense against synonym substitution-based adversarial attacks via Dirichlet neighborhood ensemble

Y Zhou, X Zheng, CJ Hsieh, KW Chang… - Association for …, 2021 - par.nsf.gov

Although deep neural networks have achieved prominent performance on many NLP tasks,
they are vulnerable to adversarial examples. We propose Dirichlet Neighborhood Ensemble …

被引用次数：69 相关文章所有 3 个版本

高级搜索

QQ 群

Defending pre-trained language models from adversarial word substitutions without performance sacrifice

Rethinking textual adversarial defense for pre-trained language models

Adversarial glue: A multi-task benchmark for robustness evaluation of language models

Defense against adversarial attacks in nlp via dirichlet neighborhood ensemble

Better robustness by more coverage: Adversarial training with mixup augmentation for robust fine-tuning

Rmlm: A flexible defense framework for proactively mitigating word-level adversarial attacks

[PDF][PDF] Towards Semantics-and Domain-Aware Adversarial Attacks.

Phrase-level textual adversarial attack with label preservation

Searching for an effective defender: Benchmarking defense against adversarial word substitution

[PDF][PDF] Defense against synonym substitution-based adversarial attacks via Dirichlet neighborhood ensemble

相关搜索

引用