Adversarial attacks and defenses for large language models (LLMs): methods, frameworks & challenges

P Kumar - International Journal of Multimedia Information …, 2024 - Springer
Large language models (LLMs) have exhibited remarkable efficacy and proficiency in a
wide array of NLP endeavors. Nevertheless, concerns are growing rapidly regarding the …

Evaluating the cybersecurity robustness of commercial llms against adversarial prompts: A promptbench analysis

T Goto, K Ono, A Morita - Authorea Preprints, 2024 - techrxiv.org
This study presents a comprehensive evaluation of the cybersecurity robustness of five
leading Large Language Models (LLMs)-ChatGPT-4, Google Gemini, Anthropic Claude …

Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

KC Fraser, H Dawkins, S Kiritchenko - arXiv preprint arXiv:2406.15583, 2024 - arxiv.org
Large language models (LLMs) have advanced to a point that even humans have difficulty
discerning whether a text was generated by another human, or by a computer. However …

AISPACE at SemEval-2024 task 8: A Class-balanced Soft-voting System for Detecting Multi-generator Machine-generated Text

R Gu, X Meng - arXiv preprint arXiv:2404.00950, 2024 - arxiv.org
SemEval-2024 Task 8 provides a challenge to detect human-written and machine-
generated text. There are 3 subtasks for different detection scenarios. This paper proposes a …

Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification

L Peng, Y Gu, C Dong, Z Wang, J Shang - arXiv preprint arXiv:2406.11115, 2024 - arxiv.org
For extremely weak-supervised text classification, pioneer research generates pseudo
labels by mining texts similar to the class names from the raw corpus, which may end up with …

Detecting Machine-Generated Texts: Not Just" AI vs Humans" and Explainability is Complicated

J Ji, R Li, S Li, J Guo, W Qiu, Z Huang, C Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
As LLMs rapidly advance, increasing concerns arise regarding risks about actual authorship
of texts we see online and in real world. The task of distinguishing LLM-authored texts is …

Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors

Y Zhou, B He, L Sun - arXiv preprint arXiv:2406.08922, 2024 - arxiv.org
With the launch of ChatGPT, large language models (LLMs) have attracted global attention.
In the realm of article writing, LLMs have witnessed extensive utilization, giving rise to …

Cross-cultural Inspiration Detection and Analysis in Real and LLM-generated Social Media Data

O Ignat, GG Lakshmy, R Mihalcea - arXiv preprint arXiv:2404.12933, 2024 - arxiv.org
Inspiration is linked to various positive outcomes, such as increased creativity, productivity,
and happiness. Although inspiration has great potential, there has been limited effort toward …

[PDF][PDF] Detecting Scams Using Large Language Models

J LIMING - arXiv preprint arXiv:2402.03147, 2024 - researchgate.net
Key characteristics of large language models include:(1) Large language models are
characterized by their immense size, often having hundreds of millions to billions of …

Class-Conditional self-reward mechanism for improved Text-to-Image models

SE Ghazouali, A Gucciardi, U Michelucci - arXiv preprint arXiv:2405.13473, 2024 - arxiv.org
Self-rewarding have emerged recently as a powerful tool in the field of Natural Language
Processing (NLP), allowing language models to generate high-quality relevant responses …