查看文章

informs.org 中的 [HTML]

Breaking the sample size barrier in model-based reinforcement learning with a generative model

作者

Gen Li, Yuting Wei, Yuejie Chi, Yuxin Chen

发表日期

2024/1

期刊

Operations Research

卷号

期号

页码范围

203-221

出版商

INFORMS

简介

This paper is concerned with the sample efficiency of reinforcement learning, assuming access to a generative model (or simulator). We first consider γ-discounted infinite-horizon Markov decision processes (MDPs) with state space and action space . Despite a number of prior works tackling this problem, a complete picture of the trade-offs between sample complexity and statistical accuracy has yet to be determined. In particular, all prior results suffer from a severe sample size barrier in the sense that their claimed statistical guarantees hold only when the sample size exceeds at least . The current paper overcomes this barrier by certifying the minimax optimality of two algorithms—a perturbed model-based algorithm and a conservative model-based algorithm—as soon as the sample size exceeds the order of (modulo some log factor). Moving beyond infinite-horizon MDPs, we further study time …

引用总数

被引用次数：138

2019202020212022202320241 4 34 33 43 23

学术搜索中的文章

Breaking the sample size barrier in model-based reinforcement learning with a generative model*

G Li, Y Wei, Y Chi, Y Gu, Y Chen - Advances in neural information processing systems, 2020

被引用次数：133 相关文章所有 10 个版本

Breaking the sample size barrier in model-based reinforcement learning with a generative model

G Li, Y Wei, Y Chi, Y Chen - Operations Research, 2024

被引用次数：11 相关文章所有 9 个版本

Breaking the Sample Size Barrier in Model-Based Reinforcement Learning*

Y Wei - 2020