所有版本 - 学术资源搜索

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

Quantile off-policy evaluation via deep conditional generative learning

Y Xu, C Shi, S Luo, L Wang, R Song - arXiv preprint arXiv:2212.14466, 2022 - arxiv.org

Off-Policy evaluation (OPE) is concerned with evaluating a new target policy using offline
data generated by a potentially different behavior policy. It is critical in a number of …

被引用次数：5 相关文章

Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

Y Xu, C Shi, S Luo, L Wang, R Song - arXiv e-prints, 2022 - ui.adsabs.harvard.edu

Off-Policy evaluation (OPE) is concerned with evaluating a new target policy using offline
data generated by a potentially different behavior policy. It is critical in a number of …

高级搜索

QQ 群

Quantile off-policy evaluation via deep conditional generative learning

Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

引用