Bayesian generational population-based training

T Eimer, M Lindauer… - … Conference on Machine …, 2023 - proceedings.mlr.press

In order to improve reproducibility, deep reinforcement learning (RL) has been adopting
better scientific practices such as standardized evaluation metrics and reporting. However …

被引用次数：28 相关文章所有 7 个版本

[PDF] neurips.cc

Bayesian optimization over discrete and mixed spaces via probabilistic reparameterization

S Daulton, X Wan, D Eriksson… - Advances in …, 2022 - proceedings.neurips.cc

Optimizing expensive-to-evaluate black-box functions of discrete (and potentially
continuous) design parameters is a ubiquitous problem in scientific and engineering …

被引用次数：37 相关文章所有 8 个版本

[PDF] arxiv.org

Automl in the age of large language models: Current challenges, future opportunities and risks

A Tornede, D Deng, T Eimer, J Giovanelli… - arXiv preprint arXiv …, 2023 - arxiv.org

The fields of both Natural Language Processing (NLP) and Automated Machine Learning
(AutoML) have achieved remarkable results over the past years. In NLP, especially Large …

被引用次数：24 相关文章所有 6 个版本

[PDF] academia.edu

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arXiv preprint arXiv:2306.16021, 2023 - academia.edu

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

被引用次数：11 相关文章所有 2 个版本

[PDF] arxiv.org

Dexpbt: Scaling up dexterous manipulation for hand-arm systems with population based training

A Petrenko, A Allshire, G State, A Handa… - arXiv preprint arXiv …, 2023 - arxiv.org

In this work, we propose algorithms and methods that enable learning dexterous object
manipulation using simulated one-or two-armed robots equipped with multi-fingered hand …

被引用次数：10 相关文章所有 2 个版本

[PDF] mlr.press

Multi-objective population based training

A Dushatskiy, A Chebykin… - International …, 2023 - proceedings.mlr.press

Abstract Population Based Training (PBT) is an efficient hyperparameter optimization
algorithm. PBT is a single-objective algorithm, but many real-world hyperparameter …

被引用次数：3 相关文章所有 7 个版本

[PDF] arxiv.org

Autorl hyperparameter landscapes

A Mohan, C Benjamins, K Wienecke… - arXiv preprint arXiv …, 2023 - arxiv.org

Although Reinforcement Learning (RL) has shown to be capable of producing impressive
results, its use is limited by the impact of its hyperparameters on performance. This often …

被引用次数：10 相关文章所有 9 个版本

[PDF] jair.org Full View

Structure in Deep Reinforcement Learning: A Survey and Open Problems

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024 - jair.org

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Bayesian optimisation of functions on graphs

X Wan, P Osselin, H Kenlay, B Ru… - Advances in …, 2023 - proceedings.neurips.cc

The increasing availability of graph-structured data motivates the task of optimising over
functions defined on the node set of graphs. Traditional graph search algorithms can be …

被引用次数：1 相关文章所有 7 个版本

[PDF] arxiv.org

Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning

H Bai, R Cheng - IEEE Transactions on Emerging Topics in …, 2024 - ieeexplore.ieee.org

Hyperparameter optimization plays a key role in the machine learning domain. Its
significance is especially pronounced in reinforcement learning (RL), where agents …

被引用次数：1 相关文章所有 3 个版本

高级搜索

QQ 群