Measuring sample efficiency and generalization in reinforcement learning benchmarks: Neurips...

P Agarwal, AA Rahman, PL St-Charles… - arXiv preprint arXiv …, 2023 - arxiv.org

Transformers have significantly impacted domains like natural language processing,
computer vision, and robotics, where they improve performance compared to other neural …

被引用次数：17 相关文章所有 2 个版本

[PDF] mlr.press

Minerl diamond 2021 competition: Overview, results, and lessons learned

A Kanervisto, S Milani… - NeurIPS 2021 …, 2022 - proceedings.mlr.press

Reinforcement learning competitions advance the field by providing appropriate scope and
support to develop solutions toward a specific problem. To promote the development of …

被引用次数：28 相关文章所有 5 个版本

[PDF] acm.org

Autofl: Enabling heterogeneity-aware energy efficient federated learning

YG Kim, CJ Wu - MICRO-54: 54th Annual IEEE/ACM International …, 2021 - dl.acm.org

Federated learning enables a cluster of decentralized mobile devices at the edge to
collaboratively train a shared machine learning model, while keeping all the raw training …

被引用次数：93 相关文章所有 5 个版本

[PDF] neurips.cc

On the importance of exploration for generalization in reinforcement learning

Y Jiang, JZ Kolter, R Raileanu - Advances in Neural …, 2024 - proceedings.neurips.cc

Existing approaches for improving generalization in deep reinforcement learning (RL) have
mostly focused on representation learning, neglecting RL-specific aspects such as …

被引用次数：19 相关文章所有 5 个版本

[PDF] neurips.cc

Interpretable and explainable logical policies via neurally guided symbolic abstraction

Q Delfosse, H Shindo, D Dhami… - Advances in Neural …, 2024 - proceedings.neurips.cc

The limited priors required by neural networks make them the dominating choice to encode
and learn policies using reinforcement learning (RL). However, they are also black-boxes …

被引用次数：23 相关文章所有 5 个版本

[PDF] neurips.cc

Heuristic-guided reinforcement learning

CA Cheng, A Kolobov… - Advances in Neural …, 2021 - proceedings.neurips.cc

We provide a framework to accelerate reinforcement learning (RL) algorithms by heuristics
that are constructed by domain knowledge or offline data. Tabula rasa RL algorithms require …

被引用次数：70 相关文章所有 7 个版本

[PDF] ieee.org

Rlops: Development life-cycle of reinforcement learning aided open ran

P Li, J Thomas, X Wang, A Khalil, A Ahmad… - IEEE …, 2022 - ieeexplore.ieee.org

Radio access network (RAN) technologies continue to evolve, with Open RAN gaining the
most recent momentum. In the O-RAN specifications, the RAN intelligent controllers (RICs) …

被引用次数：40 相关文章所有 6 个版本

[PDF] arxiv.org

Procedural generalization by planning with self-supervised world models

A Anand, J Walker, Y Li, E Vértes… - arXiv preprint arXiv …, 2021 - arxiv.org

One of the key promises of model-based reinforcement learning is the ability to generalize
using an internal model of the world to make predictions in novel environments and tasks …

被引用次数：37 相关文章所有 3 个版本

[PDF] arxiv.org

Hyperbolic deep reinforcement learning

E Cetin, B Chamberlain, M Bronstein… - arXiv preprint arXiv …, 2022 - arxiv.org

We propose a new class of deep reinforcement learning (RL) algorithms that model latent
representations in hyperbolic space. Sequential decision-making requires reasoning about …

被引用次数：24 相关文章所有 4 个版本

[PDF] arxiv.org

Cross-trajectory representation learning for zero-shot generalization in RL

B Mazoure, AM Ahmed, P MacAlpine, RD Hjelm… - arXiv preprint arXiv …, 2021 - arxiv.org

A highly desirable property of a reinforcement learning (RL) agent--and a major difficulty for
deep RL approaches--is the ability to generalize policies learned on a few tasks over a high …

被引用次数：29 相关文章所有 3 个版本

高级搜索

QQ 群