Revisiting the arcade learning environment: Evaluation protocols and open problems for general...

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：320 相关文章所有 9 个版本

[PDF] aaai.org

A review of uncertainty for deep reinforcement learning

O Lockwood, M Si - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org

Uncertainty is ubiquitous in games, both in the agents playing games and often in the games
themselves. Working with uncertainty is therefore an important component of successful …

被引用次数：57 相关文章所有 9 个版本

[PDF] arxiv.org

Mastering diverse domains through world models

D Hafner, J Pasukonis, J Ba, T Lillicrap - arXiv preprint arXiv:2301.04104, 2023 - arxiv.org

Developing a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …

被引用次数：499 相关文章所有 2 个版本

[PDF] neurips.cc

Deep reinforcement learning at the edge of the statistical precipice

R Agarwal, M Schwarzer, PS Castro… - Advances in neural …, 2021 - proceedings.neurips.cc

Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …

被引用次数：713 相关文章所有 8 个版本

[PDF] jair.org Full View

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org

The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

被引用次数：394 相关文章所有 9 个版本

[PDF] mlr.press

Bigger, better, faster: Human-level atari with human-level efficiency

M Schwarzer, JSO Ceron, A Courville… - International …, 2023 - proceedings.mlr.press

We introduce a value-based RL agent, which we call BBF, that achieves super-human
performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used …

被引用次数：78 相关文章所有 8 个版本

[PDF] arxiv.org

Mastering atari with discrete world models

D Hafner, T Lillicrap, M Norouzi, J Ba - arXiv preprint arXiv:2010.02193, 2020 - arxiv.org

Intelligent agents need to generalize from past experience to achieve goals in complex
environments. World models facilitate such generalization and allow learning behaviors …

被引用次数：904 相关文章所有 7 个版本

[PDF] ieee.org

Meta-learning in neural networks: A survey

T Hospedales, A Antoniou, P Micaelli… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent
years. Contrary to conventional approaches to AI where tasks are solved from scratch using …

被引用次数：2430 相关文章所有 10 个版本

[PDF] mlr.press

Meta-world: A benchmark and evaluation for multi-task and meta reinforcement learning

T Yu, D Quillen, Z He, R Julian… - … on robot learning, 2020 - proceedings.mlr.press

Meta-reinforcement learning algorithms can enable robots to acquire new skills much more
quickly, by leveraging prior experience to learn how to learn. However, much of the current …

被引用次数：1173 相关文章所有 7 个版本

[PDF] arxiv.org

Mastering atari, go, chess and shogi by planning with a learned model

J Schrittwieser, I Antonoglou, T Hubert, K Simonyan… - Nature, 2020 - nature.com

Constructing agents with planning capabilities has long been one of the main challenges in
the pursuit of artificial intelligence. Tree-based planning methods have enjoyed huge …

被引用次数：2652 相关文章所有 15 个版本

高级搜索

QQ 群