Defending against backdoor attacks in natural language generation

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

Y Yao, J Duan, K Xu, Y Cai, Z Sun, Y Zhang - High-Confidence Computing, 2024 - Elsevier

Abstract Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized
natural language understanding and generation. They possess deep language …

被引用次数：516 相关文章所有 11 个版本

[PDF] arxiv.org

A survey of adversarial defenses and robustness in nlp

S Goyal, S Doddapaneni, MM Khapra… - ACM Computing …, 2023 - dl.acm.org

In the past few years, it has become increasingly evident that deep neural networks are not
resilient enough to withstand adversarial perturbations in input data, leaving them …

被引用次数：128 相关文章所有 5 个版本

[PDF] arxiv.org

When foundation model meets federated learning: Motivations, challenges, and future directions

W Zhuang, C Chen, L Lyu - arXiv preprint arXiv:2306.15546, 2023 - arxiv.org

The intersection of the Foundation Model (FM) and Federated Learning (FL) provides mutual
benefits, presents a unique opportunity to unlock new possibilities in AI research, and …

被引用次数：99 相关文章所有 2 个版本

[PDF] arxiv.org

A pathway towards responsible ai generated content

C Chen, J Fu, L Lyu - arXiv preprint arXiv:2303.01325, 2023 - arxiv.org

AI Generated Content (AIGC) has received tremendous attention within the past few years,
with content generated in the format of image, text, audio, video, etc. Meanwhile, AIGC has …

被引用次数：80 相关文章所有 2 个版本

[PDF] arxiv.org

Personal llm agents: Insights and survey about the capability, efficiency and security

Y Li, H Wen, W Wang, X Li, Y Yuan, G Liu, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

Since the advent of personal computing devices, intelligent personal assistants (IPAs) have
been one of the key technologies that researchers and engineers have focused on, aiming …

被引用次数：116 相关文章所有 3 个版本

[PDF] github.io

Backdooring multimodal learning

X Han, Y Wu, Q Zhang, Y Zhou, Y Xu… - … IEEE Symposium on …, 2024 - ieeexplore.ieee.org

Deep Neural Networks (DNNs) are vulnerable to backdoor attacks, which poison the training
set to alter the model prediction over samples with a specific trigger. While existing efforts …

被引用次数：20 相关文章所有 2 个版本

[PDF] arxiv.org

Triggerless backdoor attack for NLP tasks with clean labels

L Gan, J Li, T Zhang, X Li, Y Meng, F Wu… - arXiv preprint arXiv …, 2021 - arxiv.org

Backdoor attacks pose a new threat to NLP models. A standard strategy to construct
poisoned data in backdoor attacks is to insert triggers (eg, rare words) into selected …

被引用次数：77 相关文章所有 7 个版本

Backdoor attacks and defenses targeting multi-domain ai models: A comprehensive review

S Zhang, Y Pan, Q Liu, Z Yan, KKR Choo… - ACM Computing …, 2024 - dl.acm.org

Since the emergence of security concerns in artificial intelligence (AI), there has been
significant attention devoted to the examination of backdoor attacks. Attackers can utilize …

被引用次数：3 相关文章

[PDF] arxiv.org

Backdoor attacks and countermeasures in natural language processing models: A comprehensive security review

P Cheng, Z Wu, W Du, H Zhao, W Lu, G Liu - arXiv preprint arXiv …, 2023 - arxiv.org

Applicating third-party data and models has become a new paradigm for language modeling
in NLP, which also introduces some potential security vulnerabilities because attackers can …

被引用次数：21 相关文章所有 2 个版本

[PDF] arxiv.org

A survey of backdoor attacks and defenses on large language models: Implications for security measures

S Zhao, M Jia, Z Guo, L Gan, X Xu, X Wu, J Fu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models (LLMs), which bridge the gap between human language
understanding and complex problem-solving, achieve state-of-the-art performance on …

被引用次数：10 相关文章所有 2 个版本

高级搜索

QQ 群