Dialogue POMDP components (part I): learning states and observations

S Adams, T Cody, PA Beling - Artificial Intelligence Review, 2022 - Springer

Learning from demonstration, or imitation learning, is the process of learning to act in an
environment from examples provided by a teacher. Inverse reinforcement learning (IRL) is a …

被引用次数：84 相关文章所有 8 个版本

[PDF] springer.com

Hybrid fuzzy AHP–TOPSIS approach to prioritizing solutions for inverse reinforcement learning

V Kukreja - Complex & Intelligent Systems, 2023 - Springer

Reinforcement learning (RL) techniques nurture building up solutions for sequential
decision-making problems under uncertainty and ambiguity. RL has agents with a reward …

被引用次数：51 相关文章所有 4 个版本

[PDF] researchgate.net

Dialogue POMDP components (Part II): learning the reward function

H Chinaei, B Chaib-draa - International Journal of Speech Technology, 2014 - Springer

The partially observable Markov decision process (POMDP) framework has been applied in
dialogue systems as a formal framework to represent uncertainty explicitly while being …

被引用次数：7 相关文章所有 5 个版本

[PDF] aston.ac.uk

Requirements-aware models to support better informed decision-making for self-adaptation using partially observable markov decision processes

LH Garcia Paucar - 2020 - publications.aston.ac.uk

A self-adaptive system (SAS) is a system that can adapt its behaviour in re-sponse to
environmental fluctuations at runtime and its own changes. Therefore, the decision-making …

被引用次数：2 相关文章所有 6 个版本

[图书][B] Building Dialogue POMDPs from Expert Dialogues: An End-to-end Approach

H Chinaei, B Chaib-draa - 2016 - books.google.com

This book discusses the Partially Observable Markov Decision Process (POMDP) framework
applied in dialogue systems. It presents POMDP as a formal framework to represent …

被引用次数：3 相关文章所有 4 个版本

Learning the Dialog POMDP Model Components

H Chinaei, B Chaib-draa, H Chinaei… - … Dialogue POMDPs from …, 2016 - Springer

Learning the Dialog POMDP Model Components | SpringerLink Skip to main content
Advertisement SpringerLink Account Menu Find a journal Publish with us Track your research …

[PDF] nova.edu

Five-Factor Model as a Predictor for Spoken Dialog Systems

T Carter - 2016 - search.proquest.com

Human behavior varies widely as does the design of spoken dialog systems (SDS). The
search for predictors to match a user's preference and efficiency for a specific dialog …

高级搜索

QQ 群