Multi-objectivization and ensembles of shapings in reinforcement learning

T Brys, A Harutyunyan, P Vrancx, A Nowé, ME Taylor - Neurocomputing, 2017 - Elsevier
Ensemble techniques are a powerful approach to creating better decision makers in
machine learning. Multiple decision makers are trained to solve a given task, grouped in an …

Utilizing prior solutions for reward shaping and composition in entropy-regularized reinforcement learning

J Adamczyk, A Arriojas, S Tiomkin… - Proceedings of the AAAI …, 2023 - ojs.aaai.org
In reinforcement learning (RL), the ability to utilize prior knowledge from previously solved
tasks can allow agents to quickly solve new problems. In some cases, these new problems …

Unpacking reward shaping: Understanding the benefits of reward engineering on sample complexity

A Gupta, A Pacchiano, Y Zhai… - Advances in Neural …, 2022 - proceedings.neurips.cc
The success of reinforcement learning in a variety of challenging sequential decision-
making problems has been much discussed, but often ignored in this discussion is the …

Effective diversity in population based reinforcement learning

J Parker-Holder, A Pacchiano… - Advances in …, 2020 - proceedings.neurips.cc
Exploration is a key problem in reinforcement learning, since agents can only learn from
data they acquire in the environment. With that in mind, maintaining a population of agents is …

Exploration-guided reward shaping for reinforcement learning under sparse rewards

R Devidze, P Kamalaruban… - Advances in Neural …, 2022 - proceedings.neurips.cc
We study the problem of reward shaping to accelerate the training process of a
reinforcement learning agent. Existing works have considered a number of different reward …

Discovering hierarchical achievements in reinforcement learning via contrastive learning

S Moon, J Yeom, B Park… - Advances in Neural …, 2024 - proceedings.neurips.cc
Discovering achievements with a hierarchical structure in procedurally generated
environments presents a significant challenge. This requires an agent to possess a broad …

Combining multiple correlated reward and shaping signals by measuring confidence

T Brys, A Nowé, D Kudenko, M Taylor - Proceedings of the AAAI …, 2014 - ojs.aaai.org
Multi-objective problems with correlated objectives are a class of problems that deserve
specific attention. In contrast to typical multi-objective problems, they do not require the …

[图书][B] Shaping and policy search in reinforcement learning

AY Ng - 2003 - search.proquest.com
To make reinforcement learning algorithms run in a reasonable amount of time, it is
frequently necessary to use a well-chosen reward function that gives appropriate “hints” to …

A Bayesian sampling approach to exploration in reinforcement learning

J Asmuth, L Li, ML Littman, A Nouri… - arXiv preprint arXiv …, 2012 - arxiv.org
We present a modular approach to reinforcement learning that uses a Bayesian
representation of the uncertainty over models. The approach, BOSS (Best of Sampled Set) …

Iteratively learn diverse strategies with state distance information

W Fu, W Du, J Li, S Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc
In complex reinforcement learning (RL) problems, policies with similar rewards may have
substantially different behaviors. It remains a fundamental challenge to optimize rewards …