Report from the nsf future directions workshop on automatic evaluation of dialog: Research directions and challenges

S Mehri, J Choi, LF D'Haro, J Deriu, M Eskenazi… - arXiv preprint arXiv …, 2022 - arxiv.org
This is a report on the NSF Future Directions Workshop on Automatic Evaluation of Dialog.
The workshop explored the current state of the art along with its limitations and suggested …

Creative writing with an ai-powered writing assistant: Perspectives from professional writers

D Ippolito, A Yuan, A Coenen, S Burnam - arXiv preprint arXiv:2211.05030, 2022 - arxiv.org
Recent developments in natural language generation (NLG) using neural language models
have brought us closer than ever to the goal of building AI-powered creative writing tools …

A comprehensive assessment of dialog evaluation metrics

YT Yeh, M Eskenazi, S Mehri - arXiv preprint arXiv:2106.03706, 2021 - arxiv.org
Automatic evaluation metrics are a crucial component of dialog systems research. Standard
language evaluation metrics are known to be ineffective for evaluating dialog. As such …

Leveraging large language models for automated dialogue analysis

SE Finch, ES Paek, JD Choi - arXiv preprint arXiv:2309.06490, 2023 - arxiv.org
Developing high-performing dialogue systems benefits from the automatic identification of
undesirable behaviors in system responses. However, detecting such behaviors remains …

CDConv: A benchmark for contradiction detection in Chinese conversations

C Zheng, J Zhou, Y Zheng, L Peng, Z Guo… - arXiv preprint arXiv …, 2022 - arxiv.org
Dialogue contradiction is a critical issue in open-domain dialogue systems. The
contextualization nature of conversations makes dialogue contradiction detection rather …

Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue Systems

SE Finch, JD Finch, JD Choi - arXiv preprint arXiv:2212.09180, 2022 - arxiv.org
Despite tremendous advancements in dialogue systems, stable evaluation still requires
human judgments producing notoriously high-variance metrics due to their inherent …

Mitigating contradictions in dialogue based on contrastive learning

W Li, J Kong, B Liao, Y Cai - Findings of the Association for …, 2022 - aclanthology.org
Chatbot models have achieved remarkable progress in recent years but tend to yield
contradictory responses. In this paper, we exploit the advantage of contrastive learning …

A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems

S Sato, R Akama, J Suzuki, K Inui - arXiv preprint arXiv:2403.12500, 2024 - arxiv.org
Mitigating the generation of contradictory responses poses a substantial challenge in
dialogue response generation. The quality and quantity of available contradictory response …

N-best response-based analysis of contradiction-awareness in neural response generation models

S Sato, R Akama, H Ouchi, R Tokuhisa… - arXiv preprint arXiv …, 2022 - arxiv.org
Avoiding the generation of responses that contradict the preceding context is a significant
challenge in dialogue response generation. One feasible method is post-processing, such …

I already said that! Degenerating redundant questions in open-domain dialogue systems.

L Mai, J Carson-Berndsen - … of the 61st Annual Meeting of the …, 2023 - aclanthology.org
Neural text generation models have achieved remarkable success in carrying on short open-
domain conversations. However, their performance degrades significantly in the long term …