Multi-step reinforcement learning: A unifying algorithm

K De Asis, J Hernandez-Garcia, G Holland… - Proceedings of the AAAI …, 2018 - ojs.aaai.org
… We refer to algorithms which make use of a multi-step atomic backup as atomic multi-step
atomic multi-step algorithm is characterized by its n-step return. For atomic multi-step Sarsa, …

Multi-step greedy reinforcement learning algorithms

M Tomar, Y Efroni… - … on Machine Learning, 2020 - proceedings.mlr.press
… We also establish a connection between our multi-step … Finally, we show the advantage
of using our multi-step greedy … tal to performance, our multi-step greedy algorithms indeed …

Understanding multi-step deep reinforcement learning: A systematic study of the DQN target

JF Hernandez-Garcia, RS Sutton - arXiv preprint arXiv:1901.07510, 2019 - arxiv.org
reinforcement learning, direct comparisons of multi-stepreinforcement learning literature.
The main reason significance has taken such a secondary role in deep reinforcement learning

Fully convolutional network with multi-step reinforcement learning for image processing

R Furuta, N Inoue, T Yamasaki - Proceedings of the AAAI conference on …, 2019 - ojs.aaai.org
This paper tackles a new problem setting: reinforcement learning with pixel-wise rewards (pixelRL)
for image processing. After the introduction of the deep Q-network, deep RL has …

Multi-step reinforcement learning for single image super-resolution

K Vassilo, C Heatwole, T Taha… - Proceedings of the …, 2020 - openaccess.thecvf.com
… This research applies a multi-agent Reinforcement Learning (RL) algorithm to SISR, creating
an advanced ensemble approach for combining powerful GANs. In our implementation …

Multi-step reinforcement learning for model-free predictive energy management of an electrified off-highway vehicle

Q Zhou, J Li, B Shuai, H Williams, Y He, Z Li, H Xu… - Applied Energy, 2019 - Elsevier
… suggests that multi-step reinforcement learning can achieve optimal model-free predictive
control without extra training of Markov chain models [33]. Multi-step reinforcement learning is …

A novel multi-step Q-learning method to improve data efficiency for deep reinforcement learning

Y Yuan, ZL Yu, Z Gu, Y Yeboah, W Wei, X Deng… - Knowledge-Based …, 2019 - Elsevier
… The experimental results demonstrate that the proposed multi-step methods greatly improve
… The new multi-step method contains some of the best features of traditional multi-step and …

The effect of multi-step methods on overestimation in deep reinforcement learning

L Meng, R Gorbet, D Kulić - 2020 25th International Conference …, 2021 - ieeexplore.ieee.org
learning speed. We also discuss the advantages and disadvantages of different ways to do
multi-step … timation and underestimation that underlies offline multi-step methods. Finally, we …

Speeding-up reinforcement learning with multi-step actions

R Schoknecht, M Riedmiller - … Conference Madrid, Spain, August 28–30 …, 2002 - Springer
… We extended Q-learning by integrating the concept of multi-step actions (MSAs) together
with the intra-MSA method. The new MSA-Q-learning algorithm efficiently uses training …

A multi-step reinforcement learning algorithm

ZC Zhang, KS Hu, HY Huang, S Li… - Applied Mechanics and …, 2011 - Trans Tech Publ
… Similarly to the construction of Sarsa(λ,k), we could construct the multi-step variants of
other RL algorithms. For example, Q(λ,k) could be proposed based on Q-learning. …