Transformers in reinforcement learning: a survey

P Agarwal, AA Rahman, PL St-Charles… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformers have significantly impacted domains like natural language processing,
computer vision, and robotics, where they improve performance compared to other neural …

Minerl diamond 2021 competition: Overview, results, and lessons learned

A Kanervisto, S Milani… - NeurIPS 2021 …, 2022 - proceedings.mlr.press
Reinforcement learning competitions advance the field by providing appropriate scope and
support to develop solutions toward a specific problem. To promote the development of …

Autofl: Enabling heterogeneity-aware energy efficient federated learning

YG Kim, CJ Wu - MICRO-54: 54th Annual IEEE/ACM International …, 2021 - dl.acm.org
Federated learning enables a cluster of decentralized mobile devices at the edge to
collaboratively train a shared machine learning model, while keeping all the raw training …

On the importance of exploration for generalization in reinforcement learning

Y Jiang, JZ Kolter, R Raileanu - Advances in Neural …, 2024 - proceedings.neurips.cc
Existing approaches for improving generalization in deep reinforcement learning (RL) have
mostly focused on representation learning, neglecting RL-specific aspects such as …

Interpretable and explainable logical policies via neurally guided symbolic abstraction

Q Delfosse, H Shindo, D Dhami… - Advances in Neural …, 2024 - proceedings.neurips.cc
The limited priors required by neural networks make them the dominating choice to encode
and learn policies using reinforcement learning (RL). However, they are also black-boxes …

Heuristic-guided reinforcement learning

CA Cheng, A Kolobov… - Advances in Neural …, 2021 - proceedings.neurips.cc
We provide a framework to accelerate reinforcement learning (RL) algorithms by heuristics
that are constructed by domain knowledge or offline data. Tabula rasa RL algorithms require …

Rlops: Development life-cycle of reinforcement learning aided open ran

P Li, J Thomas, X Wang, A Khalil, A Ahmad… - IEEE …, 2022 - ieeexplore.ieee.org
Radio access network (RAN) technologies continue to evolve, with Open RAN gaining the
most recent momentum. In the O-RAN specifications, the RAN intelligent controllers (RICs) …

Procedural generalization by planning with self-supervised world models

A Anand, J Walker, Y Li, E Vértes… - arXiv preprint arXiv …, 2021 - arxiv.org
One of the key promises of model-based reinforcement learning is the ability to generalize
using an internal model of the world to make predictions in novel environments and tasks …

Hyperbolic deep reinforcement learning

E Cetin, B Chamberlain, M Bronstein… - arXiv preprint arXiv …, 2022 - arxiv.org
We propose a new class of deep reinforcement learning (RL) algorithms that model latent
representations in hyperbolic space. Sequential decision-making requires reasoning about …

Cross-trajectory representation learning for zero-shot generalization in RL

B Mazoure, AM Ahmed, P MacAlpine, RD Hjelm… - arXiv preprint arXiv …, 2021 - arxiv.org
A highly desirable property of a reinforcement learning (RL) agent--and a major difficulty for
deep RL approaches--is the ability to generalize policies learned on a few tasks over a high …