Siren's song in the AI ocean: a survey on hallucination in large language models

Y Zhang, Y Li, L Cui, D Cai, L Liu, T Fu… - arXiv preprint arXiv …, 2023 - arxiv.org
While large language models (LLMs) have demonstrated remarkable capabilities across a
range of downstream tasks, a significant concern revolves around their propensity to exhibit …

Explainability for large language models: A survey

H Zhao, H Chen, F Yang, N Liu, H Deng, H Cai… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …

Leak, cheat, repeat: Data contamination and evaluation malpractices in closed-source llms

S Balloccu, P Schmidtová, M Lango… - arXiv preprint arXiv …, 2024 - arxiv.org
Natural Language Processing (NLP) research is increasingly focusing on the use of Large
Language Models (LLMs), with some of the most popular ones being either fully or partially …

Does mapo tofu contain coffee? probing llms for food-related cultural knowledge

L Zhou, T Karidi, W Liu, N Garneau, Y Cao… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent studies have highlighted the presence of cultural biases in Large Language Models
(LLMs), yet often lack a robust methodology to dissect these phenomena comprehensively …

Acquiring and modeling abstract commonsense knowledge via conceptualization

M He, T Fang, W Wang, Y Song - Artificial Intelligence, 2024 - Elsevier
Conceptualization, or viewing entities and situations as instances of abstract concepts in
mind and making inferences based on that, is a vital component in human intelligence for …

Vullibgen: Identifying vulnerable third-party libraries via generative pre-trained model

T Chen, L Li, L Zhu, Z Li, G Liang, D Li, Q Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
To avoid potential risks posed by vulnerabilities in third-party libraries, security researchers
maintain vulnerability databases (eg, NVD) containing vulnerability reports, each of which …

[HTML][HTML] Assessing how accurately large language models encode and apply the common European framework of reference for languages

L Benedetto, G Gaudeau, A Caines, P Buttery - Computers and Education …, 2025 - Elsevier
Abstract Large Language Models (LLMs) can have a transformative effect on a variety of
domains, including education, and it is therefore pressing to understand whether these …

Editing conceptual knowledge for large language models

X Wang, S Mao, N Zhang, S Deng, Y Yao… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, there has been a growing interest in knowledge editing for Large Language
Models (LLMs). Current approaches and evaluations merely explore the instance-level …

Conversational Interactions with NPCs in LLM-Driven Gaming: Guidelines from a Content Analysis of Player Feedback

SR Cox, WT Ooi - International Workshop on Chatbot Research and …, 2023 - Springer
The growing capability and availability of large language models (LLMs) have led to their
adoption in a number of domains. One application domain that could prove fruitful is to video …

AIGC 大模型测评综述: 使能技术, 安全隐患和应对.

许志伟, 李海龙, 李博, 李涛, 王嘉泰… - Journal of Frontiers …, 2024 - search.ebscohost.com
人工智能生成内容(AIGC) 模型因出色的内容生成能力, 在全球范围内引起了广泛关注与应用.
然而AIGC 大模型的快速发展也带来了一系列隐患, 例如模型生成结果的可解释性 …