Conservative bayesian model-based value expansion for offline policy optimization

Y Sun, J Zhang, C Jia, H Lin, J Ye… - … Conference on Machine …, 2023 - proceedings.mlr.press

For offline reinforcement learning (RL), model-based methods are expected to be data-
efficient as they incorporate dynamics models to generate more data. However, due to …

被引用次数：18 相关文章所有 4 个版本

[PDF] arxiv.org

How to learn and generalize from three minutes of data: Physics-constrained and uncertainty-aware neural stochastic differential equations

F Djeumou, C Neary, U Topcu - arXiv preprint arXiv:2306.06335, 2023 - arxiv.org

We present a framework and algorithms to learn controlled dynamics models using neural
stochastic differential equations (SDEs)--SDEs whose drift and diffusion terms are both …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning

H Lin, YY Xu, Y Sun, Z Zhang, YC Li, C Jia, J Ye… - arXiv preprint arXiv …, 2024 - arxiv.org

Model-based methods in reinforcement learning offer a promising approach to enhance
data efficiency by facilitating policy exploration within a dynamics model. However …

相关文章所有 2 个版本

[PDF] arxiv.org

Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning

M Nakhaei, A Scannell, J Pajarinen - arXiv preprint arXiv:2406.08238, 2024 - arxiv.org

Offline reinforcement learning (RL) allows learning sequential behavior from fixed datasets.
Since offline datasets do not cover all possible situations, many methods collect additional …

相关文章所有 2 个版本

[PDF] arxiv.org

Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning

A Akgül, M Haußmann, M Kandemir - arXiv preprint arXiv:2406.04088, 2024 - arxiv.org

Current approaches to model-based offline Reinforcement Learning (RL) often incorporate
uncertainty-based reward penalization to address the distributional shift problem. While …

相关文章所有 2 个版本

[PDF] arxiv.org

Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization

CE Luis, AG Bottero, J Vinogradska… - arXiv preprint arXiv …, 2023 - arxiv.org

We consider the problem of quantifying uncertainty over expected cumulative rewards in
model-based reinforcement learning. In particular, we focus on characterizing the variance …

相关文章所有 2 个版本

[PDF] utoronto.ca

Who Should I Trust?: Uncertainty and Risk for Knowledge Transfer from Multiple Sources in Reinforcement Learning Domains

M Gimelfarb - 2023 - search.proquest.com

Despite the recent success of reinforcement learning (RL) in simulated domains and
industrial applications, sample-efficiency remains a fundamental limitation of many model …

相关文章所有 2 个版本

[PDF] utexas.edu

Learning for autonomy in the wild: theory, algorithms, and practice

F Djeumou - 2023 - repositories.lib.utexas.edu

How can autonomous systems learn to operate in the wild, ie, complex, dynamic, and
uncertain real-world environments? Despite recent and significant breakthroughs in artificial …

相关文章所有 2 个版本

高级搜索

QQ 群