Survey on reinforcement learning for language processing

V Uc-Cetina, N Navarro-Guerrero… - Artificial Intelligence …, 2023 - Springer
In recent years some researchers have explored the use of reinforcement learning (RL)
algorithms as key components in the solution of various natural language processing (NLP) …

A survey on recent advances and challenges in reinforcement learning methods for task-oriented dialogue policy learning

WC Kwan, HR Wang, HM Wang, KF Wong - Machine Intelligence …, 2023 - Springer
Dialogue policy learning (DPL) is a key component in a task-oriented dialogue (TOD)
system. Its goal is to decide the next action of the dialogue system, given the dialogue state …

Multi-agent reinforcement learning: Methods, applications, visionary prospects, and challenges

Z Zhou, G Liu, Y Tang - arXiv preprint arXiv:2305.10091, 2023 - arxiv.org
Multi-agent reinforcement learning (MARL) is a widely used Artificial Intelligence (AI)
technique. However, current studies and applications need to address its scalability, non …

Efficient dialog policy learning by reasoning with contextual knowledge

H Zhang, Z Zeng, K Lu, K Wu, S Zhang - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Goal-oriented dialog policy learning algorithms aim to learn a dialog policy for selecting
language actions based on the current dialog state. Deep reinforcement learning methods …

Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning

T Saha, S Saha, P Bhattacharyya - PloS one, 2020 - journals.plos.org
Purpose Developing a Dialogue/Virtual Agent (VA) that can handle complex tasks (need) of
the user pertaining to multiple intents of a domain is challenging as it requires the agent to …

Learning dialog policies from weak demonstrations

G Gordon-Hall, PJ Gorinski, SB Cohen - arXiv preprint arXiv:2004.11054, 2020 - arxiv.org
Deep reinforcement learning is a promising approach to training a dialog manager, but
current methods struggle with the large state and action spaces of multi-domain dialog …

A dynamic goal adapted task oriented dialogue agent

A Tiwari, T Saha, S Saha, S Sengupta, A Maitra… - Plos one, 2021 - journals.plos.org
Purpose Existing virtual agents (VAs) present in dialogue systems are either information
retrieval based or static goal-driven. However, in real-world situations, end-users might not …

Efficient dialogue complementary policy learning via deep q-network policy and episodic memory policy

Y Zhao, Z Wang, C Zhu, S Wang - Proceedings of the 2021 …, 2021 - aclanthology.org
Deep reinforcement learning has shown great potential in training dialogue policies.
However, its favorable performance comes at the cost of many rounds of interaction. Most of …

Augmenting knowledge through statistical, goal-oriented human-robot dialog

S Amiri, S Bajracharya, C Goktolgal… - 2019 IEEE/RSJ …, 2019 - ieeexplore.ieee.org
Some robots can interact with humans using natural language, and identify service requests
through human-robot dialog. However, few robots are able to improve their language …

Dynamic reward-based dueling deep dyna-q: Robust policy learning in noisy environments

Y Zhao, Z Wang, K Yin, R Zhang, Z Huang… - Proceedings of the AAAI …, 2020 - aaai.org
Task-oriented dialogue systems provide a convenient interface to help users complete tasks.
An important consideration for task-oriented dialogue systems is the ability to against the …