Multi-step greedy reinforcement learning algorithms

M Tomar, Y Efroni… - … Conference on Machine …, 2020 - proceedings.mlr.press
Multi-step greedy policies have been extensively used in model-based reinforcement
learning (RL), both when a model of the environment is available (eg, in the game of Go) …

Multi-step Greedy Reinforcement Learning Algorithms

M Tomar, Y Efroni, M Ghavamzadeh - arXiv preprint arXiv:1910.02919, 2019 - arxiv.org
Multi-step greedy policies have been extensively used in model-based reinforcement
learning (RL), both when a model of the environment is available (eg,~ in the game of Go) …

Multi-step Greedy Reinforcement Learning Algorithms

M Tomar, Y Efroni, M Ghavamzadeh - arXiv e-prints, 2019 - ui.adsabs.harvard.edu
Multi-step greedy policies have been extensively used in model-based reinforcement
learning (RL), both when a model of the environment is available (eg,~ in the game of Go) …

Multi-step Greedy Reinforcement Learning Algorithms

M Tomar, Y Efroni… - … Conference on Machine …, 2020 - proceedings.mlr.press
Multi-step greedy policies have been extensively used in model-based reinforcement
learning (RL), both when a model of the environment is available (eg, in the game of Go) …

[PDF][PDF] Multi-step Greedy Reinforcement Learning Algorithms

M Tomar, Y Efroni, M Ghavamzadeh - mohammadghavamzadeh.github.io
Multi-step greedy policies have been extensively used in model-based reinforcement
learning (RL), both when a model of the environment is available (eg, in the game of Go) …

Multi-step greedy reinforcement learning algorithms

M Tomar, Y Efroni, M Ghavamzadeh - Proceedings of the 37th …, 2020 - dl.acm.org
Multi-step greedy policies have been extensively used in model-based reinforcement
learning (RL), both when a model of the environment is available (eg, in the game of Go) …