“real attackers don't compute gradients”: bridging the gap between adversarial ml research...

E Shayegani, MAA Mamun, Y Fu, P Zaree… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) are swiftly advancing in architecture and capability, and as
they integrate more deeply into complex systems, the urgency to scrutinize their security …

被引用次数：106 相关文章所有 2 个版本

[PDF] forth.gr

Defense strategies for adversarial machine learning: A survey

P Bountakas, A Zarras, A Lekidis, C Xenakis - Computer Science Review, 2023 - Elsevier

Abstract Adversarial Machine Learning (AML) is a recently introduced technique, aiming to
deceive Machine Learning (ML) models by providing falsified inputs to render those models …

被引用次数：27 相关文章所有 3 个版本

[PDF] arxiv.org

Not what you've signed up for: Compromising real-world llm-integrated applications with indirect prompt injection

K Greshake, S Abdelnabi, S Mishra, C Endres… - Proceedings of the 16th …, 2023 - dl.acm.org

Large Language Models (LLMs) are increasingly being integrated into applications, with
versatile functionalities that can be easily modulated via natural language prompts. So far, it …

被引用次数：326 相关文章所有 3 个版本

[PDF] arxiv.org

Prompt Injection attack against LLM-integrated Applications

Y Liu, G Deng, Y Li, K Wang, Z Wang, X Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs), renowned for their superior proficiency in language
comprehension and generation, stimulate a vibrant ecosystem of applications around them …

被引用次数：240 相关文章所有 2 个版本

[PDF] arxiv.org

Jailbreaker: Automated jailbreak across multiple large language model chatbots

G Deng, Y Liu, Y Li, K Wang, Y Zhang, Z Li… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) have revolutionized Artificial Intelligence (AI) services due
to their exceptional proficiency in understanding and generating human-like text. LLM …

被引用次数：195 相关文章所有 2 个版本

[PDF] ciso2ciso.com

[PDF][PDF] Tree of attacks: Jailbreaking black-box llms automatically

A Mehrotra, M Zampetakis, P Kassianik… - arXiv preprint arXiv …, 2023 - ciso2ciso.com

Abstract While Large Language Models (LLMs) display versatile functionality, they continue
to generate harmful, biased, and toxic content, as demonstrated by the prevalence of …

被引用次数：132 相关文章所有 3 个版本

[PDF] acm.org

A survey on malware detection with graph representation learning

T Bilot, N El Madhoun, K Al Agha, A Zouaoui - ACM Computing Surveys, 2024 - dl.acm.org

Malware detection has become a major concern due to the increasing number and
complexity of malware. Traditional detection methods based on signatures and heuristics …

被引用次数：16 相关文章所有 14 个版本

[PDF] github.io

[PDF][PDF] Masterkey: Automated jailbreaking of large language model chatbots

G Deng, Y Liu, Y Li, K Wang, Y Zhang, Z Li… - Proc. ISOC …, 2024 - tianweiz07.github.io

Large language models (LLMs), such as chatbots, have made significant strides in various
fields but remain vulnerable to jailbreak attacks, which aim to elicit inappropriate responses …

被引用次数：100 相关文章所有 2 个版本

[HTML] sciencedirect.com

[HTML][HTML] Adversarial machine learning in industry: A systematic literature review

FV Jedrzejewski, L Thode, J Fischbach, T Gorschek… - Computers & …, 2024 - Elsevier

Abstract Adversarial Machine Learning (AML) discusses the act of attacking and defending
Machine Learning (ML) Models, an essential building block of Artificial Intelligence (AI). ML …

被引用次数：5 相关文章

[PDF] usenix.org

Towards more practical threat models in artificial intelligence security

K Grosse, L Bieringer, TR Besold… - 33rd USENIX Security …, 2024 - usenix.org

Recent works have identified a gap between research and practice in artificial intelligence
security: threats studied in academia do not always reflect the practical use and security …

被引用次数：9 相关文章所有 4 个版本

高级搜索

QQ 群