Privacy in large language models: Attacks, defenses and future directions

H Li, Y Chen, J Luo, J Wang, H Peng, Y Kang… - arXiv preprint arXiv …, 2023 - arxiv.org
The advancement of large language models (LLMs) has significantly enhanced the ability to
effectively tackle various downstream NLP tasks and unify these tasks into generative …

Longwriter: Unleashing 10,000+ word generation from long context llms

Y Bai, J Zhang, X Lv, L Zheng, S Zhu, L Hou… - arXiv preprint arXiv …, 2024 - arxiv.org
Current long context large language models (LLMs) can process inputs up to 100,000
tokens, yet struggle to generate outputs exceeding even a modest length of 2,000 words …

Eia: Environmental injection attack on generalist web agents for privacy leakage

Z Liao, L Mo, C Xu, M Kang, J Zhang, C Xiao… - arXiv preprint arXiv …, 2024 - arxiv.org
Generalist web agents have demonstrated remarkable potential in autonomously
completing a wide range of tasks on real websites, significantly boosting human productivity …

Jailbreaking llm-controlled robots

A Robey, Z Ravichandran, V Kumar, H Hassani… - arXiv preprint arXiv …, 2024 - arxiv.org
The recent introduction of large language models (LLMs) has revolutionized the field of
robotics by enabling contextual reasoning and intuitive human-robot interaction in domains …

Defending jailbreak prompts via in-context adversarial game

Y Zhou, Y Han, H Zhuang, K Guo, Z Liang… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) demonstrate remarkable capabilities across diverse
applications. However, concerns regarding their security, particularly the vulnerability to …

Aligning llms to be robust against prompt injection

S Chen, A Zharmagambetov, S Mahloujifar… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) are becoming increasingly prevalent in modern software
systems, interfacing between the user and the internet to assist with tasks that require …

Evaluating the instruction-following robustness of large language models to prompt injection

Z Li, B Peng, P He, X Yan - … of the 2024 Conference on Empirical …, 2024 - aclanthology.org
Abstract Large Language Models (LLMs) have demonstrated exceptional proficiency in
instruction-following, making them increasingly integral to various applications. However …

Granular privacy control for geolocation with vision language models

E Mendes, Y Chen, J Hays, S Das, W Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
Vision Language Models (VLMs) are rapidly advancing in their capability to answer
information-seeking questions. As these models are widely deployed in consumer …

Sfr-rag: Towards contextually faithful llms

XP Nguyen, S Pandit, S Purushwalkam, A Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
Retrieval Augmented Generation (RAG), a paradigm that integrates external contextual
information with large language models (LLMs) to enhance factual accuracy and relevance …

On memorization of large language models in logical reasoning

C Xie, Y Huang, C Zhang, D Yu, X Chen, BY Lin… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) achieve good performance on challenging reasoning
benchmarks, yet could also make basic reasoning mistakes. This contrasting behavior is …