A gentle introduction to reinforcement learning and its application in different fields

M Naeem, STH Rizvi, A Coronato - IEEE access, 2020 - ieeexplore.ieee.org
Due to the recent progress in Deep Neural Networks, Reinforcement Learning (RL) has
become one of the most important and useful technology. It is a learning method where a …

A survey of reinforcement learning informed by natural language

J Luketina, N Nardelli, G Farquhar, J Foerster… - arXiv preprint arXiv …, 2019 - arxiv.org
To be successful in real-world tasks, Reinforcement Learning (RL) needs to exploit the
compositional, relational, and hierarchical structure of the world, and learn to transfer it to the …

Learning to model the world with language

J Lin, Y Du, O Watkins, D Hafner, P Abbeel… - arXiv preprint arXiv …, 2023 - arxiv.org
To interact with humans in the world, agents need to understand the diverse types of
language that people use, relate them to the visual world, and act based on them. While …

Language understanding for text-based games using deep reinforcement learning

K Narasimhan, T Kulkarni, R Barzilay - arXiv preprint arXiv:1506.08941, 2015 - arxiv.org
In this paper, we consider the task of learning control policies for text-based games. In these
games, all interactions in the virtual world are through text and the underlying state is not …

[PDF][PDF] Learning to solve arithmetic word problems with verb categorization

MJ Hosseini, H Hajishirzi, O Etzioni… - Proceedings of the …, 2014 - aclanthology.org
This paper presents a novel approach to learning to solve simple arithmetic word problems.
Our system, ARIS, analyzes each of the sentences in the problem statement to identify the …

Learning dependency-based compositional semantics

P Liang, MI Jordan, D Klein - Computational Linguistics, 2013 - direct.mit.edu
Suppose we want to build a system that answers a natural language question by
representing its semantics as a logical forxm and computing the answer given a structured …

Neural programmer: Inducing latent programs with gradient descent

A Neelakantan, QV Le, I Sutskever - arXiv preprint arXiv:1511.04834, 2015 - arxiv.org
Deep neural networks have achieved impressive supervised classification performance in
many tasks including image recognition, speech recognition, and sequence to sequence …

Read and reap the rewards: Learning to play atari with the help of instruction manuals

Y Wu, Y Fan, PP Liang, A Azaria… - Advances in Neural …, 2024 - proceedings.neurips.cc
High sample complexity has long been a challenge for RL. On the other hand, humans learn
to perform tasks not only from interaction or demonstrations, but also by reading …

[PDF][PDF] Solving geometry problems: Combining text and diagram interpretation

M Seo, H Hajishirzi, A Farhadi, O Etzioni… - Proceedings of the …, 2015 - aclanthology.org
This paper introduces GEOS, the first automated system to solve unaltered SAT geometry
questions by combining text understanding and diagram interpretation. We model the …

A comprehensive survey on instruction following

R Lou, K Zhang, W Yin - arXiv preprint arXiv:2303.10475, 2023 - arxiv.org
Task semantics can be expressed by a set of input-output examples or a piece of textual
instruction. Conventional machine learning approaches for natural language processing …