Model-Bellman inconsistency for model-based offline reinforcement learning

Y Sun, J Zhang, C Jia, H Lin, J Ye… - … Conference on Machine …, 2023 - proceedings.mlr.press
For offline reinforcement learning (RL), model-based methods are expected to be data-
efficient as they incorporate dynamics models to generate more data. However, due to …

How to learn and generalize from three minutes of data: Physics-constrained and uncertainty-aware neural stochastic differential equations

F Djeumou, C Neary, U Topcu - arXiv preprint arXiv:2306.06335, 2023 - arxiv.org
We present a framework and algorithms to learn controlled dynamics models using neural
stochastic differential equations (SDEs)--SDEs whose drift and diffusion terms are both …

Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning

H Lin, YY Xu, Y Sun, Z Zhang, YC Li, C Jia, J Ye… - arXiv preprint arXiv …, 2024 - arxiv.org
Model-based methods in reinforcement learning offer a promising approach to enhance
data efficiency by facilitating policy exploration within a dynamics model. However …

Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning

M Nakhaei, A Scannell, J Pajarinen - arXiv preprint arXiv:2406.08238, 2024 - arxiv.org
Offline reinforcement learning (RL) allows learning sequential behavior from fixed datasets.
Since offline datasets do not cover all possible situations, many methods collect additional …

Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning

A Akgül, M Haußmann, M Kandemir - arXiv preprint arXiv:2406.04088, 2024 - arxiv.org
Current approaches to model-based offline Reinforcement Learning (RL) often incorporate
uncertainty-based reward penalization to address the distributional shift problem. While …

Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization

CE Luis, AG Bottero, J Vinogradska… - arXiv preprint arXiv …, 2023 - arxiv.org
We consider the problem of quantifying uncertainty over expected cumulative rewards in
model-based reinforcement learning. In particular, we focus on characterizing the variance …

Who Should I Trust?: Uncertainty and Risk for Knowledge Transfer from Multiple Sources in Reinforcement Learning Domains

M Gimelfarb - 2023 - search.proquest.com
Despite the recent success of reinforcement learning (RL) in simulated domains and
industrial applications, sample-efficiency remains a fundamental limitation of many model …

Learning for autonomy in the wild: theory, algorithms, and practice

F Djeumou - 2023 - repositories.lib.utexas.edu
How can autonomous systems learn to operate in the wild, ie, complex, dynamic, and
uncertain real-world environments? Despite recent and significant breakthroughs in artificial …