ExPerT: Effective and Explainable Evaluation of Personalized Long-Form Text Generation

A Salemi, J Killingback, H Zamani - arXiv preprint arXiv:2501.14956, 2025 - arxiv.org
Evaluating personalized text generated by large language models (LLMs) is challenging, as
only the LLM user, ie, prompt author, can reliably assess the output, but re-engaging the …