Evaluation of reinforcement learning techniques

AK Yadav, SK Shrivastava - … of the first international conference on …, 2010 - dl.acm.org
… In this paper we proposed Q-learning algorithm and evaluation of RL techniques (Reinforcement
learning architecture, … Learning agent, the fundamental element of reinforcement

Data-efficient off-policy policy evaluation for reinforcement learning

P Thomas, E Brunskill - … Conference on Machine Learning, 2016 - proceedings.mlr.press
In this paper we present a new way of predicting the performance of a reinforcement learning
policy given historical data that may have been generated by a different policy. The ability …

[图书][B] Improved empirical methods in reinforcement-learning evaluation

VN Marivate - 2015 - search.proquest.com
… Further interrogation of the state-action value function will be made later when I cover multiple
approaches to evaluating reinforcement-learning algorithms with batch data. …

Doubly robust off-policy value evaluation for reinforcement learning

N Jiang, L Li - International conference on machine learning, 2016 - proceedings.mlr.press
We study the problem of off-policy value evaluation in reinforcement learning (RL), where
one aims to estimate the value of a new policy based on data collected by a different policy. …

The impact of task underspecification in evaluating deep reinforcement learning

V Jayawardana, C Tang, S Li… - Advances in Neural …, 2022 - proceedings.neurips.cc
… DRL methods on select MDP instances, evaluating the … evaluating on an MDP family is
nontrivial. Overall, this work identifies new challenges for empirical rigor in reinforcement learning

Deep reinforcement learning that matters

P Henderson, R Islam, P Bachman, J Pineau… - Proceedings of the …, 2018 - ojs.aaai.org
… this section we analyze some of the evaluation metrics commonly used in the reinforcement
… We focus on evaluation methods for the policy optimization view (with offline evaluation), but …

A review of off-policy evaluation in reinforcement learning

M Uehara, C Shi, N Kallus - arXiv preprint arXiv:2212.06355, 2022 - arxiv.org
Reinforcement learning (RL) is one of the most vibrant research frontiers in machine
learning … In this paper, we primarily focus on off-policy evaluation (OPE), one of the most …

Re-evaluate: Reproducibility in evaluating reinforcement learning algorithms

K Khetarpal, Z Ahmed, A Cianflone, R Islam, J Pineau - 2018 - openreview.net
… in evaluation in RL compared to supervised learning, … evaluation in RL, and propose an
evaluation pipeline that can be decoupled from the algorithm code. We hope such an evaluation

Empirical evaluation methods for multiobjective reinforcement learning algorithms

P Vamplew, R Dazeley, A Berry, R Issabekov… - Machine learning, 2011 - Springer
… empirical evaluation, to act as a foundation for future comparative studies. Two classes of
multiobjective reinforcement learning algorithms are identified, and appropriate evaluation

Empirical study of off-policy policy evaluation for reinforcement learning

C Voloshin, HM Le, N Jiang, Y Yue - arXiv preprint arXiv:1911.06854, 2019 - arxiv.org
learning and deep reinforcement learning efforts [23, 35]. As OPE is central to real-world
applications of reinforcement learning… work on OPE evaluation for reinforcement learning [17, 18…