Decision transformer: Reinforcement learning via sequence modeling

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc
We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf… - arXiv preprint arXiv …, 2023 - arxiv.org
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Transdreamer: Reinforcement learning with transformer world models

C Chen, YF Wu, J Yoon, S Ahn - arXiv preprint arXiv:2202.09481, 2022 - arxiv.org
The Dreamer agent provides various benefits of Model-Based Reinforcement Learning
(MBRL) such as sample efficiency, reusable knowledge, and safe planning. However, its …

Elements of episodic memory: insights from artificial agents

A Boyle, A Blomkvist - Philosophical Transactions B, 2024 - royalsocietypublishing.org
Many recent artificial intelligence (AI) systems take inspiration from biological episodic
memory. Here, we ask how these 'episodic-inspired'AI systems might inform our …

Large-scale retrieval for reinforcement learning

P Humphreys, A Guez, O Tieleman… - Advances in …, 2022 - proceedings.neurips.cc
Effective decision making involves flexibly relating past experiences and relevant contextual
information to a novel situation. In deep reinforcement learning (RL), the dominant paradigm …

Robocat: A self-improving generalist agent for robotic manipulation

K Bousmalis, G Vezzani, D Rao, CM Devin… - … on Machine Learning …, 2023 - openreview.net
The ability to leverage heterogeneous robotic experience from different robots and tasks to
quickly master novel skills and embodiments has the potential to transform robot learning …

Towards mental time travel: a hierarchical memory for reinforcement learning agents

A Lampinen, S Chan, A Banino… - Advances in Neural …, 2021 - proceedings.neurips.cc
Reinforcement learning agents often forget details of the past, especially after delays or
distractor tasks. Agents with common memory architectures struggle to recall and integrate …

Recurrent hypernetworks are surprisingly strong in meta-RL

J Beck, R Vuorio, Z Xiong… - Advances in Neural …, 2024 - proceedings.neurips.cc
Deep reinforcement learning (RL) is notoriously impractical to deploy due to sample
inefficiency. Meta-RL directly addresses this sample inefficiency by learning to perform few …

Visual language navigation: A survey and open challenges

SM Park, YG Kim - Artificial Intelligence Review, 2023 - Springer
With the recent development of deep learning, AI models are widely used in various
domains. AI models show good performance for definite tasks such as image classification …

Tabr: Unlocking the power of retrieval-augmented tabular deep learning

Y Gorishniy, I Rubachev, N Kartashev… - arXiv preprint arXiv …, 2023 - arxiv.org
Deep learning (DL) models for tabular data problems are receiving increasingly more
attention, while the algorithms based on gradient-boosted decision trees (GBDT) remain a …