evaluation in reinforcement learning- 学术资源搜索

Evaluation of reinforcement learning techniques

AK Yadav, SK Shrivastava - … of the first international conference on …, 2010 - dl.acm.org

… In this paper we proposed Q-learning algorithm and evaluation of RL techniques (Reinforcement
learning architecture, … Learning agent, the fundamental element of reinforcement …

被引用次数：9 相关文章

[PDF] mlr.press

Data-efficient off-policy policy evaluation for reinforcement learning

P Thomas, E Brunskill - … Conference on Machine Learning, 2016 - proceedings.mlr.press

In this paper we present a new way of predicting the performance of a reinforcement learning
policy given historical data that may have been generated by a different policy. The ability …

被引用次数：743 相关文章所有 14 个版本

[PDF] rutgers.edu

[图书][B] Improved empirical methods in reinforcement-learning evaluation

VN Marivate - 2015 - search.proquest.com

… Further interrogation of the state-action value function will be made later when I cover multiple
approaches to evaluating reinforcement-learning algorithms with batch data. …

被引用次数：10 相关文章所有 7 个版本

[PDF] mlr.press

Doubly robust off-policy value evaluation for reinforcement learning

N Jiang, L Li - International conference on machine learning, 2016 - proceedings.mlr.press

We study the problem of off-policy value evaluation in reinforcement learning (RL), where
one aims to estimate the value of a new policy based on data collected by a different policy. …

被引用次数：836 相关文章所有 10 个版本

[PDF] neurips.cc

The impact of task underspecification in evaluating deep reinforcement learning

V Jayawardana, C Tang, S Li… - Advances in Neural …, 2022 - proceedings.neurips.cc

… DRL methods on select MDP instances, evaluating the … evaluating on an MDP family is
nontrivial. Overall, this work identifies new challenges for empirical rigor in reinforcement learning…

被引用次数：15 相关文章所有 8 个版本

[PDF] aaai.org

Deep reinforcement learning that matters

P Henderson, R Islam, P Bachman, J Pineau… - Proceedings of the …, 2018 - ojs.aaai.org

… this section we analyze some of the evaluation metrics commonly used in the reinforcement
… We focus on evaluation methods for the policy optimization view (with offline evaluation), but …

被引用次数：2445 相关文章所有 19 个版本

[PDF] arxiv.org

A review of off-policy evaluation in reinforcement learning

M Uehara, C Shi, N Kallus - arXiv preprint arXiv:2212.06355, 2022 - arxiv.org

… Reinforcement learning (RL) is one of the most vibrant research frontiers in machine
learning … In this paper, we primarily focus on off-policy evaluation (OPE), one of the most …

被引用次数：55 相关文章所有 2 个版本

[PDF] openreview.net

Re-evaluate: Reproducibility in evaluating reinforcement learning algorithms

K Khetarpal, Z Ahmed, A Cianflone, R Islam, J Pineau - 2018 - openreview.net

… in evaluation in RL compared to supervised learning, … evaluation in RL, and propose an
evaluation pipeline that can be decoupled from the algorithm code. We hope such an evaluation …

被引用次数：22 相关文章

[PDF] springer.com

Empirical evaluation methods for multiobjective reinforcement learning algorithms

P Vamplew, R Dazeley, A Berry, R Issabekov… - Machine learning, 2011 - Springer

… empirical evaluation, to act as a foundation for future comparative studies. Two classes of
multiobjective reinforcement learning algorithms are identified, and appropriate evaluation …

被引用次数：370 相关文章所有 16 个版本

[PDF] arxiv.org

Empirical study of off-policy policy evaluation for reinforcement learning

C Voloshin, HM Le, N Jiang, Y Yue - arXiv preprint arXiv:1911.06854, 2019 - arxiv.org

… learning and deep reinforcement learning efforts [23, 35]. As OPE is central to real-world
applications of reinforcement learning… work on OPE evaluation for reinforcement learning [17, 18…

被引用次数：148 相关文章所有 11 个版本

高级搜索

QQ 群