multi-step reinforcement learning- 学术资源搜索

Multi-step reinforcement learning: A unifying algorithm

K De Asis, J Hernandez-Garcia, G Holland… - Proceedings of the AAAI …, 2018 - ojs.aaai.org

… We refer to algorithms which make use of a multi-step atomic backup as atomic multi-step …
atomic multi-step algorithm is characterized by its n-step return. For atomic multi-step Sarsa, …

被引用次数：138 相关文章所有 7 个版本

[PDF] mlr.press

Multi-step greedy reinforcement learning algorithms

M Tomar, Y Efroni… - … on Machine Learning, 2020 - proceedings.mlr.press

… We also establish a connection between our multi-step … Finally, we show the advantage
of using our multi-step greedy … tal to performance, our multi-step greedy algorithms indeed …

被引用次数：11 相关文章所有 6 个版本

[PDF] arxiv.org

Understanding multi-step deep reinforcement learning: A systematic study of the DQN target

JF Hernandez-Garcia, RS Sutton - arXiv preprint arXiv:1901.07510, 2019 - arxiv.org

… reinforcement learning, direct comparisons of multi-step … reinforcement learning literature.
The main reason significance has taken such a secondary role in deep reinforcement learning …

被引用次数：75 相关文章所有 4 个版本

[PDF] aaai.org

Fully convolutional network with multi-step reinforcement learning for image processing

R Furuta, N Inoue, T Yamasaki - Proceedings of the AAAI conference on …, 2019 - ojs.aaai.org

This paper tackles a new problem setting: reinforcement learning with pixel-wise rewards (pixelRL)
for image processing. After the introduction of the deep Q-network, deep RL has …

被引用次数：62 相关文章所有 10 个版本

[PDF] thecvf.com

Multi-step reinforcement learning for single image super-resolution

K Vassilo, C Heatwole, T Taha… - Proceedings of the …, 2020 - openaccess.thecvf.com

… This research applies a multi-agent Reinforcement Learning (RL) algorithm to SISR, creating
an advanced ensemble approach for combining powerful GANs. In our implementation …

被引用次数：14 相关文章所有 6 个版本

[PDF] bham.ac.uk

Multi-step reinforcement learning for model-free predictive energy management of an electrified off-highway vehicle

Q Zhou, J Li, B Shuai, H Williams, Y He, Z Li, H Xu… - Applied Energy, 2019 - Elsevier

… suggests that multi-step reinforcement learning can achieve optimal model-free predictive
control without extra training of Markov chain models [33]. Multi-step reinforcement learning is …

被引用次数：139 相关文章所有 11 个版本

A novel multi-step Q-learning method to improve data efficiency for deep reinforcement learning

Y Yuan, ZL Yu, Z Gu, Y Yeboah, W Wei, X Deng… - Knowledge-Based …, 2019 - Elsevier

… The experimental results demonstrate that the proposed multi-step methods greatly improve
… The new multi-step method contains some of the best features of traditional multi-step and …

被引用次数：48 相关文章所有 2 个版本

[PDF] arxiv.org

The effect of multi-step methods on overestimation in deep reinforcement learning

L Meng, R Gorbet, D Kulić - 2020 25th International Conference …, 2021 - ieeexplore.ieee.org

… learning speed. We also discuss the advantages and disadvantages of different ways to do
multi-step … timation and underestimation that underlies offline multi-step methods. Finally, we …

被引用次数：36 相关文章所有 7 个版本

Speeding-up reinforcement learning with multi-step actions

R Schoknecht, M Riedmiller - … Conference Madrid, Spain, August 28–30 …, 2002 - Springer

… We extended Q-learning by integrating the concept of multi-step actions (MSAs) together
with the intra-MSA method. The new MSA-Q-learning algorithm efficiently uses training …

被引用次数：20 相关文章所有 7 个版本

A multi-step reinforcement learning algorithm

ZC Zhang, KS Hu, HY Huang, S Li… - Applied Mechanics and …, 2011 - Trans Tech Publ

… Similarly to the construction of Sarsa(λ,k), we could construct the multi-step variants of
other RL algorithms. For example, Q(λ,k) could be proposed based on Q-learning. …

被引用次数：4 相关文章所有 3 个版本

高级搜索

QQ 群