相关文章- 学术资源搜索

Using natural language for reward shaping in reinforcement learning

P Goyal, S Niekum, RJ Mooney - arXiv preprint arXiv:1903.02020, 2019 - arxiv.org

Recent reinforcement learning (RL) approaches have shown strong performance in complex
domains such as Atari games, but are often highly sample inefficient. A common approach to …

被引用次数：192 相关文章所有 10 个版本

[PDF] neurips.cc

Reward learning from human preferences and demonstrations in atari

B Ibarz, J Leike, T Pohlen, G Irving… - Advances in neural …, 2018 - proceedings.neurips.cc

To solve complex real-world problems with reinforcement learning, we cannot rely on
manually specified reward functions. Instead, we need humans to communicate an objective …

被引用次数：449 相关文章所有 7 个版本

[PDF] arxiv.org

Incentivizing exploration in reinforcement learning with deep predictive models

BC Stadie, S Levine, P Abbeel - arXiv preprint arXiv:1507.00814, 2015 - arxiv.org

Achieving efficient and scalable exploration in complex domains poses a major challenge in
reinforcement learning. While Bayesian and PAC-MDP approaches to the exploration …

被引用次数：568 相关文章所有 3 个版本

[PDF] arxiv.org

Reinforcement learning with unsupervised auxiliary tasks

M Jaderberg, V Mnih, WM Czarnecki, T Schaul… - arXiv preprint arXiv …, 2016 - arxiv.org

Deep reinforcement learning agents have achieved state-of-the-art results by directly
maximising cumulative reward. However, environments contain a much wider variety of …

被引用次数：1466 相关文章所有 7 个版本

[PDF] neurips.cc

Accelerating reinforcement learning through gpu atari emulation

S Dalton - Advances in Neural Information Processing …, 2020 - proceedings.neurips.cc

Abstract We introduce CuLE (CUDA Learning Environment), a CUDA port of the Atari
Learning Environment (ALE) which is used for the development of deep reinforcement …

被引用次数：40 相关文章所有 6 个版本

[PDF] mlr.press

Agent57: Outperforming the atari human benchmark

AP Badia, B Piot, S Kapturowski… - International …, 2020 - proceedings.mlr.press

Atari games have been a long-standing benchmark in the reinforcement learning (RL)
community for the past decade. This benchmark was proposed to test general competency …

被引用次数：703 相关文章所有 5 个版本

[PDF] openreview.net

Investigating multi-task pretraining and generalization in reinforcement learning

AA Taiga, R Agarwal, J Farebrother… - The Eleventh …, 2023 - openreview.net

Deep reinforcement learning~(RL) has achieved remarkable successes in complex single-
task settings. However, designing RL agents that can learn multiple tasks and leverage prior …

被引用次数：34 相关文章所有 2 个版本

[PDF] arxiv.org

Return-based contrastive representation learning for reinforcement learning

G Liu, C Zhang, L Zhao, T Qin, J Zhu, J Li, N Yu… - arXiv preprint arXiv …, 2021 - arxiv.org

Recently, various auxiliary tasks have been proposed to accelerate representation learning
and improve sample efficiency in deep reinforcement learning (RL). However, existing …

被引用次数：54 相关文章所有 8 个版本

[PDF] arxiv.org

Beating atari with natural language guided reinforcement learning

R Kaplan, C Sauer, A Sosa - arXiv preprint arXiv:1704.05539, 2017 - arxiv.org

We introduce the first deep reinforcement learning agent that learns to beat Atari games with
the aid of natural language instructions. The agent uses a multimodal embedding between …

被引用次数：75 相关文章所有 3 个版本

[PDF] arxiv.org

A survey of reinforcement learning informed by natural language

J Luketina, N Nardelli, G Farquhar, J Foerster… - arXiv preprint arXiv …, 2019 - arxiv.org

To be successful in real-world tasks, Reinforcement Learning (RL) needs to exploit the
compositional, relational, and hierarchical structure of the world, and learn to transfer it to the …

被引用次数：325 相关文章所有 10 个版本

高级搜索

QQ 群

Using natural language for reward shaping in reinforcement learning

Reward learning from human preferences and demonstrations in atari

Incentivizing exploration in reinforcement learning with deep predictive models

Reinforcement learning with unsupervised auxiliary tasks

Accelerating reinforcement learning through gpu atari emulation

Agent57: Outperforming the atari human benchmark

Investigating multi-task pretraining and generalization in reinforcement learning

Return-based contrastive representation learning for reinforcement learning

Beating atari with natural language guided reinforcement learning

A survey of reinforcement learning informed by natural language

相关搜索

引用