Addressing inquiries about history: An efficient and practical framework for evaluating open-doma...

Report from the nsf future directions workshop on automatic evaluation of dialog: Research directions and challenges

S Mehri, J Choi, LF D'Haro, J Deriu, M Eskenazi… - arXiv preprint arXiv …, 2022 - arxiv.org

This is a report on the NSF Future Directions Workshop on Automatic Evaluation of Dialog.
The workshop explored the current state of the art along with its limitations and suggested …

被引用次数：29 相关文章所有 3 个版本

[PDF] arxiv.org

Creative writing with an ai-powered writing assistant: Perspectives from professional writers

D Ippolito, A Yuan, A Coenen, S Burnam - arXiv preprint arXiv:2211.05030, 2022 - arxiv.org

Recent developments in natural language generation (NLG) using neural language models
have brought us closer than ever to the goal of building AI-powered creative writing tools …

被引用次数：84 相关文章所有 2 个版本

[PDF] arxiv.org

A comprehensive assessment of dialog evaluation metrics

YT Yeh, M Eskenazi, S Mehri - arXiv preprint arXiv:2106.03706, 2021 - arxiv.org

Automatic evaluation metrics are a crucial component of dialog systems research. Standard
language evaluation metrics are known to be ineffective for evaluating dialog. As such …

被引用次数：98 相关文章所有 6 个版本

[PDF] arxiv.org

Leveraging large language models for automated dialogue analysis

SE Finch, ES Paek, JD Choi - arXiv preprint arXiv:2309.06490, 2023 - arxiv.org

Developing high-performing dialogue systems benefits from the automatic identification of
undesirable behaviors in system responses. However, detecting such behaviors remains …

被引用次数：10 相关文章所有 6 个版本

[PDF] arxiv.org

CDConv: A benchmark for contradiction detection in Chinese conversations

C Zheng, J Zhou, Y Zheng, L Peng, Z Guo… - arXiv preprint arXiv …, 2022 - arxiv.org

Dialogue contradiction is a critical issue in open-domain dialogue systems. The
contextualization nature of conversations makes dialogue contradiction detection rather …

被引用次数：9 相关文章所有 3 个版本

[PDF] arxiv.org

Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue Systems

SE Finch, JD Finch, JD Choi - arXiv preprint arXiv:2212.09180, 2022 - arxiv.org

Despite tremendous advancements in dialogue systems, stable evaluation still requires
human judgments producing notoriously high-variance metrics due to their inherent …

被引用次数：16 相关文章所有 4 个版本

[PDF] aclanthology.org

Mitigating contradictions in dialogue based on contrastive learning

W Li, J Kong, B Liao, Y Cai - Findings of the Association for …, 2022 - aclanthology.org

Chatbot models have achieved remarkable progress in recent years but tend to yield
contradictory responses. In this paper, we exploit the advantage of contrastive learning …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems

S Sato, R Akama, J Suzuki, K Inui - arXiv preprint arXiv:2403.12500, 2024 - arxiv.org

Mitigating the generation of contradictory responses poses a substantial challenge in
dialogue response generation. The quality and quantity of available contradictory response …

N-best response-based analysis of contradiction-awareness in neural response generation models

S Sato, R Akama, H Ouchi, R Tokuhisa… - arXiv preprint arXiv …, 2022 - arxiv.org

Avoiding the generation of responses that contradict the preceding context is a significant
challenge in dialogue response generation. One feasible method is post-processing, such …

被引用次数：1 相关文章所有 9 个版本

[PDF] aclanthology.org

I already said that! Degenerating redundant questions in open-domain dialogue systems.

L Mai, J Carson-Berndsen - … of the 61st Annual Meeting of the …, 2023 - aclanthology.org

Neural text generation models have achieved remarkable success in carrying on short open-
domain conversations. However, their performance degrades significantly in the long term …

高级搜索

QQ 群