Non-adaptive Online Finetuning for Offline Reinforcement Learning

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Non-adaptive Online Finetuning for Offline Reinforcement Learning

在引用文章中搜索

[PDF] illinois.edu

[PDF][PDF] Offline reinforcement learning in large state spaces: Algorithms and guarantees

N Jiang, T Xie - Statistical Science, 2024 - nanjiang.cs.illinois.edu

This article introduces the theory of offline reinforcement learning in large state spaces,
where good policies are learned from historical data without online interactions with the …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations

B Chan, A Leung, J Bergstra - arXiv preprint arXiv:2410.14957, 2024 - arxiv.org

Offline-to-online reinforcement learning (O2O RL) aims to obtain a continually improving
policy as it interacts with the environment, while ensuring the initial policy behaviour is …

高级搜索

QQ 群

Non-adaptive Online Finetuning for Offline Reinforcement Learning

[PDF][PDF] Offline reinforcement learning in large state spaces: Algorithms and guarantees

Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations

引用