Hyperparameters in reinforcement learning and how to tune them

T Eimer, M Lindauer… - … Conference on Machine …, 2023 - proceedings.mlr.press
In order to improve reproducibility, deep reinforcement learning (RL) has been adopting
better scientific practices such as standardized evaluation metrics and reporting. However …

Bayesian optimization over discrete and mixed spaces via probabilistic reparameterization

S Daulton, X Wan, D Eriksson… - Advances in …, 2022 - proceedings.neurips.cc
Optimizing expensive-to-evaluate black-box functions of discrete (and potentially
continuous) design parameters is a ubiquitous problem in scientific and engineering …

Automl in the age of large language models: Current challenges, future opportunities and risks

A Tornede, D Deng, T Eimer, J Giovanelli… - arXiv preprint arXiv …, 2023 - arxiv.org
The fields of both Natural Language Processing (NLP) and Automated Machine Learning
(AutoML) have achieved remarkable results over the past years. In NLP, especially Large …

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arXiv preprint arXiv:2306.16021, 2023 - academia.edu
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Dexpbt: Scaling up dexterous manipulation for hand-arm systems with population based training

A Petrenko, A Allshire, G State, A Handa… - arXiv preprint arXiv …, 2023 - arxiv.org
In this work, we propose algorithms and methods that enable learning dexterous object
manipulation using simulated one-or two-armed robots equipped with multi-fingered hand …

Multi-objective population based training

A Dushatskiy, A Chebykin… - International …, 2023 - proceedings.mlr.press
Abstract Population Based Training (PBT) is an efficient hyperparameter optimization
algorithm. PBT is a single-objective algorithm, but many real-world hyperparameter …

Autorl hyperparameter landscapes

A Mohan, C Benjamins, K Wienecke… - arXiv preprint arXiv …, 2023 - arxiv.org
Although Reinforcement Learning (RL) has shown to be capable of producing impressive
results, its use is limited by the impact of its hyperparameters on performance. This often …

Structure in Deep Reinforcement Learning: A Survey and Open Problems

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024 - jair.org
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Bayesian optimisation of functions on graphs

X Wan, P Osselin, H Kenlay, B Ru… - Advances in …, 2023 - proceedings.neurips.cc
The increasing availability of graph-structured data motivates the task of optimising over
functions defined on the node set of graphs. Traditional graph search algorithms can be …

Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning

H Bai, R Cheng - IEEE Transactions on Emerging Topics in …, 2024 - ieeexplore.ieee.org
Hyperparameter optimization plays a key role in the machine learning domain. Its
significance is especially pronounced in reinforcement learning (RL), where agents …