Elastic decision transformer

YH Wu, X Wang, M Hamaya - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract This paper introduces Elastic Decision Transformer (EDT), a significant
advancement over the existing Decision Transformer (DT) and its variants. Although DT …

Waypoint transformer: Reinforcement learning via supervised learning with intermediate targets

A Badrinath, Y Flet-Berliac, A Nie… - Advances in Neural …, 2024 - proceedings.neurips.cc
Despite the recent advancements in offline reinforcement learning via supervised learning
(RvS) and the success of the decision transformer (DT) architecture in various domains, DTs …

On Transforming Reinforcement Learning With Transformers: The Development Trajectory

S Hu, L Shen, Y Zhang, Y Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Transformers, originally devised for natural language processing (NLP), have also produced
significant successes in computer vision (CV). Due to their strong expression power …

Contrastive decision transformers

SG Konan, E Seraj… - Conference on Robot …, 2023 - proceedings.mlr.press
Decision Transformers (DT) have drawn upon the success of Transformers by abstracting
Reinforcement Learning as a target-return-conditioned, sequence modeling problem. In our …

Generalized decision transformer for offline hindsight information matching

H Furuta, Y Matsuo, SS Gu - arXiv preprint arXiv:2111.10364, 2021 - arxiv.org
How to extract as much learning signal from each trajectory data has been a key problem in
reinforcement learning (RL), where sample inefficiency has posed serious challenges for …

You can't count on luck: Why decision transformers and rvs fail in stochastic environments

K Paster, S McIlraith, J Ba - Advances in neural information …, 2022 - proceedings.neurips.cc
Recently, methods such as Decision Transformer that reduce reinforcement learning to a
prediction task and solve it via supervised learning (RvS) have become popular due to their …

Learn what not to learn: Action elimination with deep reinforcement learning

T Zahavy, M Haroush, N Merlis… - Advances in neural …, 2018 - proceedings.neurips.cc
Learning how to act when there are many available actions in each state is a challenging
task for Reinforcement Learning (RL) agents, especially when many of the actions are …

Emergent agentic transformer from chain of hindsight experience

H Liu, P Abbeel - International Conference on Machine …, 2023 - proceedings.mlr.press
Large transformer models powered by diverse data and model scale have dominated
natural language modeling and computer vision and pushed the frontier of multiple AI areas …

Adarl: What, where, and how to adapt in transfer reinforcement learning

B Huang, F Feng, C Lu, S Magliacane… - arXiv preprint arXiv …, 2021 - arxiv.org
One practical challenge in reinforcement learning (RL) is how to make quick adaptations
when faced with new environments. In this paper, we propose a principled framework for …

Discrete and continuous action representation for practical rl in video games

O Delalleau, M Peter, E Alonso, A Logut - arXiv preprint arXiv:1912.11077, 2019 - arxiv.org
While most current research in Reinforcement Learning (RL) focuses on improving the
performance of the algorithms in controlled environments, the use of RL under constraints …