Factcheck-GPT: End-to-End Fine-Grained Document-Level Fact-Checking and Correction of LLM Output

M Li, B Peng, M Galley, J Gao, Z Zhang - arXiv preprint arXiv:2305.14623, 2023 - arxiv.org

Fact-checking is an essential task in NLP that is commonly utilized for validating the factual
accuracy of claims. Prior work has mainly focused on fine-tuning pre-trained languages …

被引用次数：48 相关文章所有 3 个版本

[PDF] arxiv.org

Factuality of large language models in the year 2024

Y Wang, M Wang, MA Manzoor, F Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs), especially when instruction-tuned for chat, have become
part of our daily lives, freeing people from the process of searching, extracting, and …

被引用次数：12 相关文章所有 2 个版本

[PDF] arxiv.org

VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation

Y Song, Y Kim, M Iyyer - arXiv preprint arXiv:2406.19276, 2024 - arxiv.org

Existing metrics for evaluating the factuality of long-form text, such as FACTSCORE (Min et
al., 2023) and SAFE (Wei et al., 2024), decompose an input text into" atomic claims" and …

被引用次数：8 相关文章所有 3 个版本

[PDF] github.io

[PDF][PDF] Multi-fact: Assessing multilingual llms' multi-regional knowledge using factscore

S Shafayat, E Kim, J Oh, A Oh - arXiv preprint arXiv …, 2024 - globalaicultures.github.io

Abstract Large Language Models (LLMs) are prone to factuality hallucination, generating
text that contradicts established knowledge. While extensive research has addressed this in …

被引用次数：14 相关文章所有 2 个版本

[PDF] arxiv.org

Automated justification production for claim veracity in fact checking: A survey on architectures and approaches

I Eldifrawi, S Wang, A Trabelsi - arXiv preprint arXiv:2407.12853, 2024 - arxiv.org

Automated Fact-Checking (AFC) is the automated verification of claim accuracy. AFC is
crucial in discerning truth from misinformation, especially given the huge amounts of content …

被引用次数：1 相关文章所有 4 个版本

[PDF] aclanthology.org

Factuality of large language models: A survey

Y Wang, M Wang, MA Manzoor, F Liu… - Proceedings of the …, 2024 - aclanthology.org

Large language models (LLMs), especially when instruction-tuned for chat, have become
part of our daily lives, freeing people from the process of searching, extracting, and …

被引用次数：3 相关文章

[PDF] arxiv.org

Benchmarks as microscopes: A call for model metrology

M Saxon, A Holtzman, P West, WY Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Modern language models (LMs) pose a new challenge in capability assessment. Static
benchmarks inevitably saturate without providing confidence in the deployment tolerances …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

Molecular facts: Desiderata for decontextualization in llm fact verification

A Gunjal, G Durrett - arXiv preprint arXiv:2406.20079, 2024 - arxiv.org

Automatic factuality verification of large language model (LLM) generations is becoming
more and more widely used to combat hallucinations. A major point of tension in the …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph

R Vashurin, E Fadeeva, A Vazhentsev… - arXiv preprint arXiv …, 2024 - arxiv.org

Uncertainty quantification (UQ) is a critical component of machine learning (ML)
applications. The rapid proliferation of large language models (LLMs) has stimulated …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

A survey of ai-generated text forensic systems: Detection, attribution, and characterization

T Kumarage, G Agrawal, P Sheth, R Moraffah… - arXiv preprint arXiv …, 2024 - arxiv.org

We have witnessed lately a rapid proliferation of advanced Large Language Models (LLMs)
capable of generating high-quality text. While these LLMs have revolutionized text …

被引用次数：9 相关文章所有 4 个版本

高级搜索

QQ 群