A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language...

M Feffer, A Sinha, WH Deng, ZC Lipton… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

In response to rising concerns surrounding the safety, security, and trustworthiness of
Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red …

被引用次数：36 相关文章所有 2 个版本

[PDF] arxiv.org

Jailbreak attacks and defenses against large language models: A survey

S Yi, Y Liu, Z Sun, T Cong, X He, J Song, K Xu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models (LLMs) have performed exceptionally in various text-generative
tasks, including question answering, translation, code completion, etc. However, the over …

被引用次数：31 相关文章所有 4 个版本

[PDF] arxiv.org

Llm defenses are not robust to multi-turn human jailbreaks yet

N Li, Z Han, I Steneker, W Primack, R Goodside… - arXiv preprint arXiv …, 2024 - arxiv.org

Recent large language model (LLM) defenses have greatly improved models' ability to
refuse harmful queries, even when adversarially attacked. However, LLM defenses are …

被引用次数：21 相关文章所有 4 个版本

[PDF] arxiv.org

Jailbreakzoo: Survey, landscapes, and horizons in jailbreaking large language and vision-language models

H Jin, L Hu, X Li, P Zhang, C Chen, J Zhuang… - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid evolution of artificial intelligence (AI) through developments in Large Language
Models (LLMs) and Vision-Language Models (VLMs) has brought significant advancements …

被引用次数：19 相关文章所有 3 个版本

[PDF] arxiv.org

When llms meet cybersecurity: A systematic literature review

J Zhang, H Bu, H Wen, Y Chen, L Li, H Zhu - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid advancements in large language models (LLMs) have opened new avenues
across various fields, including cybersecurity, which faces an ever-evolving threat landscape …

被引用次数：34 相关文章所有 2 个版本

[PDF] purdue.edu

On large language models' resilience to coercive interrogation

Z Zhang, G Shen, G Tao, S Cheng… - 2024 IEEE Symposium on …, 2024 - computer.org

Abstract Large Language Models (LLMs) are increasingly employed in numerous
applications. It is hence important to ensure that their ethical standard aligns with humans' …

被引用次数：20 相关文章

[PDF] arxiv.org

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

K Huang, F Mo, H Li, Y Li, Y Zhang, W Yi, Y Mao… - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid development of Large Language Models (LLMs) demonstrates remarkable
multilingual capabilities in natural language processing, attracting global attention in both …

被引用次数：12 相关文章所有 2 个版本

[PDF] arxiv.org

Jailbreaking and mitigation of vulnerabilities in large language models

B Peng, Z Bi, Q Niu, M Liu, P Feng, T Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models (LLMs) have transformed artificial intelligence by advancing
natural language understanding and generation, enabling applications across fields beyond …

被引用次数：9 相关文章所有 5 个版本

[PDF] researchgate.net

[PDF][PDF] A Survey on Responsible Generative AI: What to Generate and What Not

J Gu - arXiv preprint arXiv:2404.05783, 2024 - researchgate.net

In recent years, generative AI (GenAI), like large language models and text-to-image
models, has received significant attention across various domains. However, ensuring the …

被引用次数：12 相关文章所有 2 个版本

[PDF] arxiv.org

Hallu-pi: Evaluating hallucination in multi-modal large language models within perturbed inputs

P Ding, J Wu, J Kuang, D Ma, X Cao, X Cai… - Proceedings of the …, 2024 - dl.acm.org

Multi-modal Large Language Models (MLLMs) have demonstrated remarkable performance
on various visual-language understanding and generation tasks. However, MLLMs …

被引用次数：5 相关文章所有 5 个版本

高级搜索

QQ 群