Multi-step greedy reinforcement learning algorithms

M Tomar, Y Efroni… - … on Machine Learning, 2020 - proceedings.mlr.press
… In this section, we empirically analyze the performance of the κ-PI-DQN and κ-VI-DQN
algorithms on the Atari domains: Breakout, SpaceInvaders, Seaquest, Enduro, BeamRider, and …

Crpo: A new approach for safe reinforcement learning with convergence guarantee

T Xu, Y Liang, G Lan - … Conference on Machine Learning, 2021 - proceedings.mlr.press
… finite-time analysis of primal SRL algorithms with global optimality guarantee. Our empirical
… In reinforcement learning, we aim to find an optimal policy that maximizes the expected total …

Behaviour suite for reinforcement learning

I Osband, Y Doron, M Hessel, J Aslanides… - arXiv preprint arXiv …, 2019 - arxiv.org
… By collecting clear, informative and scalable experiments; and providing accessible tools
for reproducible evaluation we hope to facilitate progress in reinforcement learning research. …

Action robust reinforcement learning and applications in continuous control

C Tessler, Y Efroni, S Mannor - … on Machine Learning, 2019 - proceedings.mlr.press
… We empirically analyze the differences between the PR-MDP and NR-MDP approaches,
and demonstrate their ability to produce robust policies under abrupt perturbations and mass …

Machine learning: An introduction to reinforcement learning

SA Fayaz, S Jahangeer Sidiq… - Machine Learning and …, 2022 - Wiley Online Library
… to reinforcement learning models, procedures, techniques, and reinforcement learning … to
the agent-environment interface and how Reinforcement Learning can be used in various daily …

A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning

A Pretorius, S Cameron, E Van Biljon… - Advances in neural …, 2020 - proceedings.neurips.cc
… Networked multi-agent reinforcement learning The agents in our networked control system
use reinforcement learning to learn how to map from states to actions. To study the effects of …

Beyond the one-step greedy approach in reinforcement learning

Y Efroni, G Dalal, B Scherrer… - … on Machine Learning, 2018 - proceedings.mlr.press
… In this work, we introduce the first such analysis. Namely, we formulate vari… Reinforcement
Learning algorithms fit well into our unified framework. We thus shed light on their empirical

An Analysis of Model-Based Reinforcement Learning From Abstracted Observations

RAN Starre, M Loog, E Congeduti… - … on Machine Learning …, 2023 - openreview.net
… They do this by finding solutions to a fundamental problem for Reinforcement Learning (RL),
the exploration-exploitation dilemma: when to take actions to obtain more information (…

On efficiency in hierarchical reinforcement learning

Z Wen, D Precup, M Ibrahimi… - Advances in …, 2020 - proceedings.neurips.cc
… 4.2 Hierarchical Reinforcement Learning We consider posterior sampling for hierarchical
reinforcement learning (PSHRL) to be PSRL applied with a particular kind of prior distribution, …

Reinforcement learning with data envelopment analysis and conditional value-at-risk for the capacity expansion problem

CY Lee, YW Chen - IEEE Transactions on Engineering …, 2023 - ieeexplore.ieee.org
… This article proposes a reinforcement learning (RL) framework embedded with data
envelopment analysis (DEA) to generate the optimal policy and guide the productivity improvement. …