- 学术资源搜索

Survey on reinforcement learning for language processing

V Uc-Cetina, N Navarro-Guerrero… - Artificial Intelligence …, 2023 - Springer

In recent years some researchers have explored the use of reinforcement learning (RL)
algorithms as key components in the solution of various natural language processing (NLP) …

被引用次数：108 相关文章所有 12 个版本

[PDF] springer.com

A survey on recent advances and challenges in reinforcement learning methods for task-oriented dialogue policy learning

WC Kwan, HR Wang, HM Wang, KF Wong - Machine Intelligence …, 2023 - Springer

Dialogue policy learning (DPL) is a key component in a task-oriented dialogue (TOD)
system. Its goal is to decide the next action of the dialogue system, given the dialogue state …

被引用次数：25 相关文章所有 6 个版本

[PDF] arxiv.org

Multi-agent reinforcement learning: Methods, applications, visionary prospects, and challenges

Z Zhou, G Liu, Y Tang - arXiv preprint arXiv:2305.10091, 2023 - arxiv.org

Multi-agent reinforcement learning (MARL) is a widely used Artificial Intelligence (AI)
technique. However, current studies and applications need to address its scalability, non …

被引用次数：14 相关文章所有 2 个版本

[PDF] aaai.org

Efficient dialog policy learning by reasoning with contextual knowledge

H Zhang, Z Zeng, K Lu, K Wu, S Zhang - Proceedings of the AAAI …, 2022 - ojs.aaai.org

Goal-oriented dialog policy learning algorithms aim to learn a dialog policy for selecting
language actions based on the current dialog state. Deep reinforcement learning methods …

被引用次数：10 相关文章所有 5 个版本

[PDF] plos.org

Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning

T Saha, S Saha, P Bhattacharyya - PloS one, 2020 - journals.plos.org

Purpose Developing a Dialogue/Virtual Agent (VA) that can handle complex tasks (need) of
the user pertaining to multiple intents of a domain is challenging as it requires the agent to …

被引用次数：21 相关文章所有 12 个版本

[PDF] arxiv.org

Learning dialog policies from weak demonstrations

G Gordon-Hall, PJ Gorinski, SB Cohen - arXiv preprint arXiv:2004.11054, 2020 - arxiv.org

Deep reinforcement learning is a promising approach to training a dialog manager, but
current methods struggle with the large state and action spaces of multi-domain dialog …

被引用次数：24 相关文章所有 6 个版本

[PDF] plos.org

A dynamic goal adapted task oriented dialogue agent

A Tiwari, T Saha, S Saha, S Sengupta, A Maitra… - Plos one, 2021 - journals.plos.org

Purpose Existing virtual agents (VAs) present in dialogue systems are either information
retrieval based or static goal-driven. However, in real-world situations, end-users might not …

被引用次数：16 相关文章所有 10 个版本

[PDF] aclanthology.org

Efficient dialogue complementary policy learning via deep q-network policy and episodic memory policy

Y Zhao, Z Wang, C Zhu, S Wang - Proceedings of the 2021 …, 2021 - aclanthology.org

Deep reinforcement learning has shown great potential in training dialogue policies.
However, its favorable performance comes at the cost of many rounds of interaction. Most of …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

Augmenting knowledge through statistical, goal-oriented human-robot dialog

S Amiri, S Bajracharya, C Goktolgal… - 2019 IEEE/RSJ …, 2019 - ieeexplore.ieee.org

Some robots can interact with humans using natural language, and identify service requests
through human-robot dialog. However, few robots are able to improve their language …

被引用次数：24 相关文章所有 5 个版本

[PDF] aaai.org

Dynamic reward-based dueling deep dyna-q: Robust policy learning in noisy environments

Y Zhao, Z Wang, K Yin, R Zhang, Z Huang… - Proceedings of the AAAI …, 2020 - aaai.org

Task-oriented dialogue systems provide a convenient interface to help users complete tasks.
An important consideration for task-oriented dialogue systems is the ability to against the …

被引用次数：21 相关文章所有 5 个版本

高级搜索

QQ 群

Survey on reinforcement learning for language processing

A survey on recent advances and challenges in reinforcement learning methods for task-oriented dialogue policy learning

Multi-agent reinforcement learning: Methods, applications, visionary prospects, and challenges

Efficient dialog policy learning by reasoning with contextual knowledge

Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning

Learning dialog policies from weak demonstrations

A dynamic goal adapted task oriented dialogue agent

Efficient dialogue complementary policy learning via deep q-network policy and episodic memory policy

Augmenting knowledge through statistical, goal-oriented human-robot dialog

Dynamic reward-based dueling deep dyna-q: Robust policy learning in noisy environments

引用