受强制性开放获取政策约束的文章 - Dhruv Madeka了解详情
可在其他位置公开访问的文章:1 篇
A few expert queries suffices for sample-efficient rl with resets and linear value approximation
P Amortila, N Jiang, D Madeka, DP Foster
Advances in Neural Information Processing Systems 35, 29637-29648, 2022
强制性开放获取政策: Natural Sciences and Engineering Research Council of Canada
出版信息和资助信息由计算机程序自动确定