Attacks and defenses for generative diffusion models: A comprehensive survey

VT Truong, LB Dang, LB Le - arXiv preprint arXiv:2408.03400, 2024 - arxiv.org
Diffusion models (DMs) have achieved state-of-the-art performance on various generative
tasks such as image synthesis, text-to-image, and text-guided image-to-image generation …

An Overview of Trustworthy AI: Advances in IP Protection, Privacy-preserving Federated Learning, Security Verification, and GAI Safety Alignment

Y Zheng, CH Chang, SH Huang… - IEEE Journal on …, 2024 - ieeexplore.ieee.org
AI has undergone a remarkable evolution journey marked by groundbreaking milestones.
Like any powerful tool, it can be turned into a weapon for devastation in the wrong hands …

UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

S Cheng, G Shen, K Zhang, G Tao, S An, H Guo… - … on Computer Vision, 2024 - Springer
Deep neural networks (DNNs) have demonstrated effectiveness in various fields. However,
DNNs are vulnerable to backdoor attacks, which inject a unique pattern, called trigger, into …

Ufid: A unified framework for input-level backdoor detection on diffusion models

Z Guan, M Hu, S Li, A Vullikanti - arXiv preprint arXiv:2404.01101, 2024 - arxiv.org
Diffusion Models are vulnerable to backdoor attacks, where malicious attackers inject
backdoors by poisoning some parts of the training samples during the training stage. This …

Diff-cleanse: Identifying and mitigating backdoor attacks in diffusion models

J Hao, X Jin, H Xiaoguang, C Tianyou… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion models (DMs) are regarded as one of the most advanced generative models today,
yet recent studies suggest that they are vulnerable to backdoor attacks, which establish …

[PDF][PDF] Exploring the Orthogonality and Linearity of Backdoor Attacks

K Zhang, S Cheng, G Shen, G Tao, S An… - … IEEE Symposium on …, 2024 - kaiyuanzhang.com
Backdoor attacks embed an attacker-chosen pattern into inputs to cause model
misclassification. This security threat to machine learning has been a long concern. There …

PureDiffusion: Using Backdoor to Counter Backdoor in Generative Diffusion Models

VT Truong, LB Le - arXiv preprint arXiv:2409.13945, 2024 - arxiv.org
Diffusion models (DMs) are advanced deep learning models that achieved state-of-the-art
capability on a wide range of generative tasks. However, recent studies have shown their …

Defending text-to-image diffusion models: Surprising efficacy of textual perturbations against backdoor attacks

O Chew, PY Lu, J Lin, HT Lin - arXiv preprint arXiv:2408.15721, 2024 - arxiv.org
Text-to-image diffusion models have been widely adopted in real-world applications due to
their ability to generate realistic images from textual descriptions. However, recent studies …

Uncovering Vision Modality Threats in Image-to-Image Tasks

H Cheng, E Xiao, J Yang, J Cao, Q Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Current image generation models can effortlessly produce high-quality, highly realistic
images, but this also increases the risk of misuse. In various Text-to-Image or Image-to …

On the Fairness, Diversity and Reliability of Text-to-Image Generative Models

J Vice, N Akhtar, R Hartley, A Mian - arXiv preprint arXiv:2411.13981, 2024 - arxiv.org
The widespread availability of multimodal generative models has sparked critical
discussions on their fairness, reliability, and potential for misuse. While text-to-image models …