Pomdp-based statistical spoken dialog systems: A review

S Young, M Gašić, B Thomson… - Proceedings of the …, 2013 - ieeexplore.ieee.org
Statistical dialog systems (SDSs) are motivated by the need for a data-driven framework that
reduces the cost of laboriously handcrafting complex dialog managers and that provides …

A survey on recent advances and challenges in reinforcement learning methods for task-oriented dialogue policy learning

WC Kwan, HR Wang, HM Wang, KF Wong - Machine Intelligence …, 2023 - Springer
Dialogue policy learning (DPL) is a key component in a task-oriented dialogue (TOD)
system. Its goal is to decide the next action of the dialogue system, given the dialogue state …

A survey of inverse reinforcement learning

S Adams, T Cody, PA Beling - Artificial Intelligence Review, 2022 - Springer
Learning from demonstration, or imitation learning, is the process of learning to act in an
environment from examples provided by a teacher. Inverse reinforcement learning (IRL) is a …

Guided dialog policy learning: Reward estimation for multi-domain task-oriented dialog

R Takanobu, H Zhu, M Huang - arXiv preprint arXiv:1908.10719, 2019 - arxiv.org
Dialog policy decides what and how a task-oriented dialog system will respond, and plays a
vital role in delivering effective conversations. Many studies apply Reinforcement Learning …

Learning to control listening-oriented dialogue using partially observable markov decision processes

T Meguro, Y Minami, R Higashinaka… - ACM Transactions on …, 2014 - dl.acm.org
Our aim is to build listening agents that attentively listen to their users and satisfy their desire
to speak and have themselves heard. This article investigates how to automatically create a …

Methods for robot behavior adaptation for cognitive neurorehabilitation

A Kubota, LD Riek - Annual review of control, robotics, and …, 2022 - annualreviews.org
An estimated 11% of adults report experiencing some form of cognitive decline, which may
be associated with conditions such as stroke or dementia and can impact their memory …

Score-based inverse reinforcement learning

L El Asri, B Piot, M Geist, R Laroche… - … on Autonomous Agents …, 2016 - inria.hal.science
This paper reports theoretical and empirical results obtained for the score-based Inverse
Reinforcement Learning (IRL) algorithm. It relies on a non-standard setting for IRL consisting …

Structured probabilistic modelling for dialogue management

P Lison - 2014 - duo.uio.no
This thesis presents a new modelling framework for dialogue management based on the
concept of probabilistic rules. Probabilistic rules are defined as if... then... else constructions …

A comprehensive reinforcement learning framework for dialogue management optimization

L Daubigney, M Geist… - IEEE Journal of …, 2012 - ieeexplore.ieee.org
Reinforcement learning is now an acknowledged approach for optimizing the interaction
strategy of spoken dialogue systems. If the first considered algorithms were quite basic (like …

[PDF][PDF] Preference-learning based Inverse Reinforcement Learning for Dialog Control.

H Sugiyama, T Meguro, Y Minami - INTERSPEECH, 2012 - isca-archive.org
Dialog systems that realize dialog control with reinforcement learning have recently been
proposed. However, reinforcement learning has an open problem that it requires a reward …