Exploration in model-based reinforcement learning by empirically estimating learning progress

M Lopes, T Lang, M Toussaint… - Advances in neural …, 2012 - proceedings.neurips.cc
Formal exploration approaches in model-based reinforcement learning estimate the
accuracy of the currently learned model without consideration of the empirical prediction …

Explicit explore-exploit algorithms in continuous state spaces

M Henaff - Advances in Neural Information Processing …, 2019 - proceedings.neurips.cc
We present a new model-based algorithm for reinforcement learning (RL) which consists of
explicit exploration and exploitation phases, and is applicable in large or infinite state …

[PDF][PDF] Model-based reinforcement learning with an approximate, learned model

L Kuvayev, RS Sutton - Proceedings of the ninth Yale workshop on …, 1996 - Citeseer
Abstract Model-based reinforcement learning, in which a model of the environment's
dynamics is learned and used to supplement direct learning from experience, has been …

A Bayesian sampling approach to exploration in reinforcement learning

J Asmuth, L Li, ML Littman, A Nouri… - arXiv preprint arXiv …, 2012 - arxiv.org
We present a modular approach to reinforcement learning that uses a Bayesian
representation of the uncertainty over models. The approach, BOSS (Best of Sampled Set) …

Using inaccurate models in reinforcement learning

P Abbeel, M Quigley, AY Ng - … of the 23rd international conference on …, 2006 - dl.acm.org
In the model-based policy search approach to reinforcement learning (RL), policies are
found using a model (or" simulator") of the Markov decision process. However, for high …

[PDF][PDF] Potential-based Shaping in Model-based Reinforcement Learning.

J Asmuth, ML Littman, R Zinkov - AAAI, 2008 - cdn.aaai.org
Potential-based shaping was designed as a way of introducing background knowledge into
model-free reinforcement-learning algorithms. By identifying states that are likely to have …

Model-based value estimation for efficient model-free reinforcement learning

V Feinberg, A Wan, I Stoica, MI Jordan… - arXiv preprint arXiv …, 2018 - arxiv.org
Recent model-free reinforcement learning algorithms have proposed incorporating learned
dynamics models as a source of additional data with the intention of reducing sample …

Integrating sample-based planning and model-based reinforcement learning

T Walsh, S Goschin, M Littman - Proceedings of the aaai conference on …, 2010 - ojs.aaai.org
Recent advancements in model-based reinforcement learning have shown that the
dynamics of many structured domains (eg DBNs) can be learned with tractable sample …

Efficient exploration in continuous-time model-based reinforcement learning

L Treven, J Hübotter, F Dorfler… - Advances in Neural …, 2024 - proceedings.neurips.cc
Reinforcement learning algorithms typically consider discrete-time dynamics, even though
the underlying systems are often continuous in time. In this paper, we introduce a model …

An experimental design perspective on model-based reinforcement learning

V Mehta, B Paria, J Schneider, S Ermon… - arXiv preprint arXiv …, 2021 - arxiv.org
In many practical applications of RL, it is expensive to observe state transitions from the
environment. For example, in the problem of plasma control for nuclear fusion, computing …