相关文章- 学术资源搜索

Elastic decision transformer

YH Wu, X Wang, M Hamaya - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract This paper introduces Elastic Decision Transformer (EDT), a significant
advancement over the existing Decision Transformer (DT) and its variants. Although DT …

被引用次数：17 相关文章所有 7 个版本

[PDF] neurips.cc

Waypoint transformer: Reinforcement learning via supervised learning with intermediate targets

A Badrinath, Y Flet-Berliac, A Nie… - Advances in Neural …, 2024 - proceedings.neurips.cc

Despite the recent advancements in offline reinforcement learning via supervised learning
(RvS) and the success of the decision transformer (DT) architecture in various domains, DTs …

被引用次数：9 相关文章所有 7 个版本

[PDF] arxiv.org

On Transforming Reinforcement Learning With Transformers: The Development Trajectory

S Hu, L Shen, Y Zhang, Y Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Transformers, originally devised for natural language processing (NLP), have also produced
significant successes in computer vision (CV). Due to their strong expression power …

被引用次数：14 相关文章所有 5 个版本

[PDF] mlr.press

Contrastive decision transformers

SG Konan, E Seraj… - Conference on Robot …, 2023 - proceedings.mlr.press

Decision Transformers (DT) have drawn upon the success of Transformers by abstracting
Reinforcement Learning as a target-return-conditioned, sequence modeling problem. In our …

被引用次数：15 相关文章所有 2 个版本

[PDF] arxiv.org

Generalized decision transformer for offline hindsight information matching

H Furuta, Y Matsuo, SS Gu - arXiv preprint arXiv:2111.10364, 2021 - arxiv.org

How to extract as much learning signal from each trajectory data has been a key problem in
reinforcement learning (RL), where sample inefficiency has posed serious challenges for …

被引用次数：82 相关文章所有 5 个版本

[PDF] neurips.cc

You can't count on luck: Why decision transformers and rvs fail in stochastic environments

K Paster, S McIlraith, J Ba - Advances in neural information …, 2022 - proceedings.neurips.cc

Recently, methods such as Decision Transformer that reduce reinforcement learning to a
prediction task and solve it via supervised learning (RvS) have become popular due to their …

被引用次数：49 相关文章所有 6 个版本

[PDF] neurips.cc

Learn what not to learn: Action elimination with deep reinforcement learning

T Zahavy, M Haroush, N Merlis… - Advances in neural …, 2018 - proceedings.neurips.cc

Learning how to act when there are many available actions in each state is a challenging
task for Reinforcement Learning (RL) agents, especially when many of the actions are …

被引用次数：227 相关文章所有 10 个版本

[PDF] mlr.press

Emergent agentic transformer from chain of hindsight experience

H Liu, P Abbeel - International Conference on Machine …, 2023 - proceedings.mlr.press

Large transformer models powered by diverse data and model scale have dominated
natural language modeling and computer vision and pushed the frontier of multiple AI areas …

被引用次数：14 相关文章所有 6 个版本

[PDF] arxiv.org

Adarl: What, where, and how to adapt in transfer reinforcement learning

B Huang, F Feng, C Lu, S Magliacane… - arXiv preprint arXiv …, 2021 - arxiv.org

One practical challenge in reinforcement learning (RL) is how to make quick adaptations
when faced with new environments. In this paper, we propose a principled framework for …

被引用次数：55 相关文章所有 5 个版本

[PDF] arxiv.org

Discrete and continuous action representation for practical rl in video games

O Delalleau, M Peter, E Alonso, A Logut - arXiv preprint arXiv:1912.11077, 2019 - arxiv.org

While most current research in Reinforcement Learning (RL) focuses on improving the
performance of the algorithms in controlled environments, the use of RL under constraints …

被引用次数：64 相关文章所有 9 个版本

高级搜索

QQ 群

Elastic decision transformer

Waypoint transformer: Reinforcement learning via supervised learning with intermediate targets

On Transforming Reinforcement Learning With Transformers: The Development Trajectory

Contrastive decision transformers

Generalized decision transformer for offline hindsight information matching

You can't count on luck: Why decision transformers and rvs fail in stochastic environments

Learn what not to learn: Action elimination with deep reinforcement learning

Emergent agentic transformer from chain of hindsight experience

Adarl: What, where, and how to adapt in transfer reinforcement learning

Discrete and continuous action representation for practical rl in video games

相关搜索

引用