Survey on reinforcement learning for language processing

V Uc-Cetina, N Navarro-Guerrero… - Artificial Intelligence …, 2023 - Springer
In recent years some researchers have explored the use of reinforcement learning (RL)
algorithms as key components in the solution of various natural language processing (NLP) …

Reinforcement learning and bandits for speech and language processing: Tutorial, review and outlook

B Lin - Expert Systems with Applications, 2024 - Elsevier
In recent years, reinforcement learning and bandits have transformed a wide range of real-
world applications including healthcare, finance, recommendation systems, robotics, and …

Generating dual sequence inferences using a neural network model

V Zhong, C Xiong, R Socher - US Patent 11,170,287, 2021 - Google Patents
A computer-implemented method for dual sequence inference using a neural network model
includes generating a codependent representation based on a first input representation of a …

Dcn+: Mixed objective and deep residual coattention for question answering

C Xiong, V Zhong, R Socher - arXiv preprint arXiv:1711.00106, 2017 - arxiv.org
Traditional models for question answering optimize using cross entropy loss, which
encourages exact answers at the cost of penalizing nearby or overlapping answers that are …

[PDF][PDF] Hierarchical reinforcement learning: a survey

M Al-Emran - International journal of computing and digital systems, 2015 - academia.edu
Reinforcement Learning (RL) has been an interesting research area in Machine Learning
and AI. Hierarchical Reinforcement Learning (HRL) that decomposes the RL problem into …

Compositional generalization by learning analytical expressions

Q Liu, S An, JG Lou, B Chen, Z Lin… - Advances in …, 2020 - proceedings.neurips.cc
Compositional generalization is a basic and essential intellective capability of human
beings, which allows us to recombine known parts readily. However, existing neural network …

Natural language generation as incremental planning under uncertainty: Adaptive information presentation for statistical dialogue systems

V Rieser, O Lemon, S Keizer - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
We present and evaluate a novel approach to natural language generation (NLG) in
statistical spoken dialogue systems (SDS) using a data-driven statistical optimization …

[PDF][PDF] Optimising incremental dialogue decisions using information density for interactive systems

N Dethlefs, H Hastie, V Rieser… - Proceedings of the 2012 …, 2012 - aclanthology.org
Incremental processing allows system designers to address several discourse phenomena
that have previously been somewhat neglected in interactive systems, such as …

Hierarchical reinforcement learning for situated natural language generation

N Dethlefs, H Cuayáhuitl - Natural Language Engineering, 2015 - cambridge.org
Natural Language Generation systems in interactive settings often face a multitude of
choices, given that the communicative effect of each utterance they generate depends …

[PDF][PDF] Conditional random fields for responsive surface realisation using global features

N Dethlefs, H Hastie, H Cuayáhuitl… - Proceedings of the 51st …, 2013 - aclanthology.org
Surface realisers in spoken dialogue systems need to be more responsive than
conventional surface realisers. They need to be sensitive to the utterance context as well as …