A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions

L Huang, W Yu, W Ma, W Zhong, Z Feng… - ACM Transactions on …, 2024 - dl.acm.org
The emergence of large language models (LLMs) has marked a significant breakthrough in
natural language processing (NLP), fueling a paradigm shift in information acquisition …

A survey of language model confidence estimation and calibration

J Geng, F Cai, Y Wang, H Koeppl, P Nakov… - arXiv preprint arXiv …, 2023 - arxiv.org
Language models (LMs) have demonstrated remarkable capabilities across a wide range of
tasks in various domains. Despite their impressive performance, the reliability of their output …

[HTML][HTML] Faithful AI in medicine: a systematic review with large language models and beyond

Q Xie, EJ Schenck, HS Yang, Y Chen, Y Peng, F Wang - MedRxiv, 2023 - ncbi.nlm.nih.gov
Artificial intelligence (AI), especially the most recent large language models (LLMs), holds
great promise in healthcare and medicine, with applications spanning from biological …

Meta-learning online adaptation of language models

N Hu, E Mitchell, CD Manning, C Finn - arXiv preprint arXiv:2305.15076, 2023 - arxiv.org
Large language models encode impressively broad world knowledge in their parameters.
However, the knowledge in static language models falls out of date, limiting the model's …

Towards fine-grained citation evaluation in generated text: A comparative analysis of faithfulness metrics

W Zhang, M Aliannejadi, Y Yuan, J Pei… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) often produce unsupported or unverifiable content, known
as" hallucinations." To mitigate this, retrieval-augmented LLMs incorporate citations …

Relic: Investigating large language model responses using self-consistency

F Cheng, V Zouhar, S Arora, M Sachan… - Proceedings of the CHI …, 2024 - dl.acm.org
Large Language Models (LLMs) are notorious for blending fact with fiction and generating
non-factual content, known as hallucinations. To address this challenge, we propose an …

Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation

Z Kasner, O Dušek - Proceedings of the 62nd Annual Meeting of …, 2024 - aclanthology.org
We analyze the behaviors of open large language models (LLMs) on the task of data-to-text
(D2T) generation, ie, generating coherent and relevant text from structured data. To avoid …

Raucg: Retrieval-augmented unsupervised counter narrative generation for hate speech

S Jiang, W Tang, X Chen, R Tanga, H Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
The Counter Narrative (CN) is a promising approach to combat online hate speech (HS)
without infringing on freedom of speech. In recent years, there has been a growing interest …

Online adaptation of language models with a memory of amortized contexts

J Tack, J Kim, E Mitchell, J Shin, YW Teh… - arXiv preprint arXiv …, 2024 - arxiv.org
Due to the rapid generation and dissemination of information, large language models
(LLMs) quickly run out of date despite enormous development costs. Due to this crucial need …

Usable XAI: 10 strategies towards exploiting explainability in the LLM era

X Wu, H Zhao, Y Zhu, Y Shi, F Yang, T Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Explainable AI (XAI) refers to techniques that provide human-understandable insights into
the workings of AI models. Recently, the focus of XAI is being extended towards Large …