Using natural language for reward shaping in reinforcement learning

P Goyal, S Niekum, RJ Mooney - arXiv preprint arXiv:1903.02020, 2019 - arxiv.org
Recent reinforcement learning (RL) approaches have shown strong performance in complex
domains such as Atari games, but are often highly sample inefficient. A common approach to …

Reward learning from human preferences and demonstrations in atari

B Ibarz, J Leike, T Pohlen, G Irving… - Advances in neural …, 2018 - proceedings.neurips.cc
To solve complex real-world problems with reinforcement learning, we cannot rely on
manually specified reward functions. Instead, we need humans to communicate an objective …

Incentivizing exploration in reinforcement learning with deep predictive models

BC Stadie, S Levine, P Abbeel - arXiv preprint arXiv:1507.00814, 2015 - arxiv.org
Achieving efficient and scalable exploration in complex domains poses a major challenge in
reinforcement learning. While Bayesian and PAC-MDP approaches to the exploration …

Reinforcement learning with unsupervised auxiliary tasks

M Jaderberg, V Mnih, WM Czarnecki, T Schaul… - arXiv preprint arXiv …, 2016 - arxiv.org
Deep reinforcement learning agents have achieved state-of-the-art results by directly
maximising cumulative reward. However, environments contain a much wider variety of …

Accelerating reinforcement learning through gpu atari emulation

S Dalton - Advances in Neural Information Processing …, 2020 - proceedings.neurips.cc
Abstract We introduce CuLE (CUDA Learning Environment), a CUDA port of the Atari
Learning Environment (ALE) which is used for the development of deep reinforcement …

Agent57: Outperforming the atari human benchmark

AP Badia, B Piot, S Kapturowski… - International …, 2020 - proceedings.mlr.press
Atari games have been a long-standing benchmark in the reinforcement learning (RL)
community for the past decade. This benchmark was proposed to test general competency …

Investigating multi-task pretraining and generalization in reinforcement learning

AA Taiga, R Agarwal, J Farebrother… - The Eleventh …, 2023 - openreview.net
Deep reinforcement learning~(RL) has achieved remarkable successes in complex single-
task settings. However, designing RL agents that can learn multiple tasks and leverage prior …

Return-based contrastive representation learning for reinforcement learning

G Liu, C Zhang, L Zhao, T Qin, J Zhu, J Li, N Yu… - arXiv preprint arXiv …, 2021 - arxiv.org
Recently, various auxiliary tasks have been proposed to accelerate representation learning
and improve sample efficiency in deep reinforcement learning (RL). However, existing …

Beating atari with natural language guided reinforcement learning

R Kaplan, C Sauer, A Sosa - arXiv preprint arXiv:1704.05539, 2017 - arxiv.org
We introduce the first deep reinforcement learning agent that learns to beat Atari games with
the aid of natural language instructions. The agent uses a multimodal embedding between …

A survey of reinforcement learning informed by natural language

J Luketina, N Nardelli, G Farquhar, J Foerster… - arXiv preprint arXiv …, 2019 - arxiv.org
To be successful in real-world tasks, Reinforcement Learning (RL) needs to exploit the
compositional, relational, and hierarchical structure of the world, and learn to transfer it to the …