[PDF][PDF] Advances and challenges in multi-domain task-oriented dialogue policy optimization

M Rohmatillah, JT Chien - APSIPA Transactions on Signal …, 2023 - nowpublishers.com
Developing a successful dialogue policy for a multi-domain task-oriented dialogue (MDTD)
system is a challenging task. Basically, a desirable dialogue policy acts as the decision …

Prompt-Based Monte-Carlo Tree Search for Goal-oriented Dialogue Policy Planning

X Yu, M Chen, Z Yu - arXiv preprint arXiv:2305.13660, 2023 - arxiv.org
Planning for goal-oriented dialogue often requires simulating future dialogue interactions
and estimating task progress. Many approaches thus consider training neural networks to …

System initiative prediction for multi-turn conversational information seeking

C Meng, M Aliannejadi, M de Rijke - Proceedings of the 32nd ACM …, 2023 - dl.acm.org
Identifying the right moment for a system to take the initiative is essential to conversational
information seeking (CIS). Existing studies have extensively studied the clarification need …

Multi-action dialog policy learning from logged user feedback

S Zhang, J Zhao, P Wang, T Wang, Z Liang… - Proceedings of the …, 2023 - ojs.aaai.org
Multi-action dialog policy (MADP), which generates multiple atomic dialog actions per turn,
has been widely applied in task-oriented dialog systems to provide expressive and efficient …

[PDF][PDF] Investigation of look-ahead techniques to improve response time in spoken dialogue system

M Ohagi, T Mizumoto, K Yoshikawa - Proc. Interspeech 2024, 2024 - isca-archive.org
This paper reports a new method that improves the response speed in spoken dialogue
systems that use large language models. In existing systems, the start of the chatbot's …