相关文章- 学术资源搜索

Multi-objectivization and ensembles of shapings in reinforcement learning

T Brys, A Harutyunyan, P Vrancx, A Nowé, ME Taylor - Neurocomputing, 2017 - Elsevier

Ensemble techniques are a powerful approach to creating better decision makers in
machine learning. Multiple decision makers are trained to solve a given task, grouped in an …

被引用次数：48 相关文章所有 5 个版本

[PDF] aaai.org

Utilizing prior solutions for reward shaping and composition in entropy-regularized reinforcement learning

J Adamczyk, A Arriojas, S Tiomkin… - Proceedings of the AAAI …, 2023 - ojs.aaai.org

In reinforcement learning (RL), the ability to utilize prior knowledge from previously solved
tasks can allow agents to quickly solve new problems. In some cases, these new problems …

被引用次数：4 相关文章所有 8 个版本

[PDF] neurips.cc

Unpacking reward shaping: Understanding the benefits of reward engineering on sample complexity

A Gupta, A Pacchiano, Y Zhai… - Advances in Neural …, 2022 - proceedings.neurips.cc

The success of reinforcement learning in a variety of challenging sequential decision-
making problems has been much discussed, but often ignored in this discussion is the …

被引用次数：39 相关文章所有 8 个版本

[PDF] neurips.cc

Effective diversity in population based reinforcement learning

J Parker-Holder, A Pacchiano… - Advances in …, 2020 - proceedings.neurips.cc

Exploration is a key problem in reinforcement learning, since agents can only learn from
data they acquire in the environment. With that in mind, maintaining a population of agents is …

被引用次数：154 相关文章所有 8 个版本

[PDF] neurips.cc

Exploration-guided reward shaping for reinforcement learning under sparse rewards

R Devidze, P Kamalaruban… - Advances in Neural …, 2022 - proceedings.neurips.cc

We study the problem of reward shaping to accelerate the training process of a
reinforcement learning agent. Existing works have considered a number of different reward …

被引用次数：20 相关文章所有 8 个版本

[PDF] neurips.cc

Discovering hierarchical achievements in reinforcement learning via contrastive learning

S Moon, J Yeom, B Park… - Advances in Neural …, 2024 - proceedings.neurips.cc

Discovering achievements with a hierarchical structure in procedurally generated
environments presents a significant challenge. This requires an agent to possess a broad …

被引用次数：2 相关文章所有 6 个版本

[PDF] aaai.org

Combining multiple correlated reward and shaping signals by measuring confidence

T Brys, A Nowé, D Kudenko, M Taylor - Proceedings of the AAAI …, 2014 - ojs.aaai.org

Multi-objective problems with correlated objectives are a class of problems that deserve
specific attention. In contrast to typical multi-objective problems, they do not require the …

被引用次数：41 相关文章所有 17 个版本

[PDF] berkeley.edu

[图书][B] Shaping and policy search in reinforcement learning

AY Ng - 2003 - search.proquest.com

To make reinforcement learning algorithms run in a reasonable amount of time, it is
frequently necessary to use a well-chosen reward function that gives appropriate “hints” to …

被引用次数：154 相关文章所有 7 个版本

[PDF] arxiv.org

A Bayesian sampling approach to exploration in reinforcement learning

J Asmuth, L Li, ML Littman, A Nouri… - arXiv preprint arXiv …, 2012 - arxiv.org

We present a modular approach to reinforcement learning that uses a Bayesian
representation of the uncertainty over models. The approach, BOSS (Best of Sampled Set) …

被引用次数：202 相关文章所有 20 个版本

[PDF] neurips.cc

Iteratively learn diverse strategies with state distance information

W Fu, W Du, J Li, S Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc

In complex reinforcement learning (RL) problems, policies with similar rewards may have
substantially different behaviors. It remains a fundamental challenge to optimize rewards …

被引用次数：2 相关文章所有 9 个版本

高级搜索

QQ 群

Multi-objectivization and ensembles of shapings in reinforcement learning

Utilizing prior solutions for reward shaping and composition in entropy-regularized reinforcement learning

Unpacking reward shaping: Understanding the benefits of reward engineering on sample complexity

Effective diversity in population based reinforcement learning

Exploration-guided reward shaping for reinforcement learning under sparse rewards

Discovering hierarchical achievements in reinforcement learning via contrastive learning

Combining multiple correlated reward and shaping signals by measuring confidence

[图书][B] Shaping and policy search in reinforcement learning

A Bayesian sampling approach to exploration in reinforcement learning

Iteratively learn diverse strategies with state distance information

相关搜索

引用