Beyond Factual Accuracy: Evaluating Coverage of Diverse Factual Information in Long-form...

文章

学术资源搜索

获得 1 条结果（用时0.02秒）

我的图书馆

Beyond Factual Accuracy: Evaluating Coverage of Diverse Factual Information in Long-form...

在引用文章中搜索

[PDF] arxiv.org

ExPerT: Effective and Explainable Evaluation of Personalized Long-Form Text Generation

A Salemi, J Killingback, H Zamani - arXiv preprint arXiv:2501.14956, 2025 - arxiv.org

Evaluating personalized text generated by large language models (LLMs) is challenging, as
only the LLM user, ie, prompt author, can reliably assess the output, but re-engaging the …

高级搜索

QQ 群

Beyond Factual Accuracy: Evaluating Coverage of Diverse Factual Information in Long-form...

ExPerT: Effective and Explainable Evaluation of Personalized Long-Form Text Generation

引用