Biological underpinnings for lifelong learning machines

D Kudithipudi, M Aguilar-Simon, J Babb… - Nature Machine …, 2022 - nature.com
Biological organisms learn from interactions with their environment throughout their lifetime.
For artificial systems to successfully act and adapt in the real world, it is desirable to similarly …

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Voyager: An open-ended embodied agent with large language models

G Wang, Y Xie, Y Jiang, A Mandlekar, C Xiao… - arXiv preprint arXiv …, 2023 - arxiv.org
We introduce Voyager, the first LLM-powered embodied lifelong learning agent in Minecraft
that continuously explores the world, acquires diverse skills, and makes novel discoveries …

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

Deep problems with neural network models of human vision

JS Bowers, G Malhotra, M Dujmović… - Behavioral and Brain …, 2023 - cambridge.org
Deep neural networks (DNNs) have had extraordinary successes in classifying
photographic images of objects and are often described as the best models of biological …

Evolving curricula with regret-based environment design

J Parker-Holder, M Jiang, M Dennis… - International …, 2022 - proceedings.mlr.press
Training generally-capable agents with reinforcement learning (RL) remains a significant
challenge. A promising avenue for improving the robustness of RL agents is through the use …

Emergent complexity and zero-shot transfer via unsupervised environment design

M Dennis, N Jaques, E Vinitsky… - Advances in neural …, 2020 - proceedings.neurips.cc
A wide range of reinforcement learning (RL) problems---including robustness, transfer
learning, unsupervised RL, and emergent complexity---require specifying a distribution of …

Human-timescale adaptation in an open-ended task space

AA Team, J Bauer, K Baumli, S Baveja… - arXiv preprint arXiv …, 2023 - arxiv.org
Foundation models have shown impressive adaptation and scalability in supervised and self-
supervised learning problems, but so far these successes have not fully translated to …

Replay-guided adversarial environment design

M Jiang, M Dennis, J Parker-Holder… - Advances in …, 2021 - proceedings.neurips.cc
Deep reinforcement learning (RL) agents may successfully generalize to new settings if
trained on an appropriately diverse set of environment and task configurations …

Human-timescale adaptation in an open-ended task space

J Bauer, K Baumli, F Behbahani… - International …, 2023 - proceedings.mlr.press
Foundation models have shown impressive adaptation and scalability in supervised and self-
supervised learning problems, but so far these successes have not fully translated to …