Reward-free exploration beyond finite-horizon

文章

学术资源搜索

获得 1 条结果（用时0.02秒）

我的图书馆

Reward-free exploration beyond finite-horizon

在引用文章中搜索

[PDF] neurips.cc

Provably efficient reward-agnostic navigation with linear value iteration

A Zanette, A Lazaric… - Advances in Neural …, 2020 - proceedings.neurips.cc

There has been growing progress on theoretical analyses for provably efficient learning in
MDPs with linear function approximation, but much of the existing work has made strong …

被引用次数：68 相关文章所有 5 个版本

高级搜索

QQ 群

Reward-free exploration beyond finite-horizon

Provably efficient reward-agnostic navigation with linear value iteration

引用