A survey of inverse reinforcement learning

S Adams, T Cody, PA Beling - Artificial Intelligence Review, 2022 - Springer
Learning from demonstration, or imitation learning, is the process of learning to act in an
environment from examples provided by a teacher. Inverse reinforcement learning (IRL) is a …

Hybrid fuzzy AHP–TOPSIS approach to prioritizing solutions for inverse reinforcement learning

V Kukreja - Complex & Intelligent Systems, 2023 - Springer
Reinforcement learning (RL) techniques nurture building up solutions for sequential
decision-making problems under uncertainty and ambiguity. RL has agents with a reward …

Dialogue POMDP components (Part II): learning the reward function

H Chinaei, B Chaib-draa - International Journal of Speech Technology, 2014 - Springer
The partially observable Markov decision process (POMDP) framework has been applied in
dialogue systems as a formal framework to represent uncertainty explicitly while being …

Requirements-aware models to support better informed decision-making for self-adaptation using partially observable markov decision processes

LH Garcia Paucar - 2020 - publications.aston.ac.uk
A self-adaptive system (SAS) is a system that can adapt its behaviour in re-sponse to
environmental fluctuations at runtime and its own changes. Therefore, the decision-making …

[图书][B] Building Dialogue POMDPs from Expert Dialogues: An End-to-end Approach

H Chinaei, B Chaib-draa - 2016 - books.google.com
This book discusses the Partially Observable Markov Decision Process (POMDP) framework
applied in dialogue systems. It presents POMDP as a formal framework to represent …

Learning the Dialog POMDP Model Components

H Chinaei, B Chaib-draa, H Chinaei… - … Dialogue POMDPs from …, 2016 - Springer
Learning the Dialog POMDP Model Components | SpringerLink Skip to main content
Advertisement SpringerLink Account Menu Find a journal Publish with us Track your research …

Five-Factor Model as a Predictor for Spoken Dialog Systems

T Carter - 2016 - search.proquest.com
Human behavior varies widely as does the design of spoken dialog systems (SDS). The
search for predictors to match a user's preference and efficiency for a specific dialog …