reinforcement learning algorithms- 学术资源搜索

Challenges of real-world reinforcement learning

G Dulac-Arnold, D Mankowitz, T Hester - arXiv preprint arXiv:1904.12901, 2019 - arxiv.org

… learning algorithm needs to learn quickly from potential mistakes without needing to repeat
them multiple times before fixing them. Thus, learning on a real system requires an algorithm …

被引用次数：633 相关文章所有 4 个版本

[PDF] neurips.cc

When to use parametric models in reinforcement learning?

HP Van Hasselt, M Hessel… - Advances in Neural …, 2019 - proceedings.neurips.cc

… Although off-policy learning algorithms may be part of the long-term answer, we do not
yet have a definitive solution. To quote Sutton and Barto [2018]: The potential for off-policy …

被引用次数：207 相关文章所有 8 个版本

[PDF] arxiv.org

Model-based reinforcement learning for atari

L Kaiser, M Babaeizadeh, P Milos, B Osinski… - arXiv preprint arXiv …, 2019 - arxiv.org

… However, some of the best model-free reinforcement learning algorithms require tens or
hundreds of millions of time steps – the equivalent of several weeks of training in real time. How …

被引用次数：915 相关文章所有 6 个版本

[PDF] mlr.press

Emphatic algorithms for deep reinforcement learning

R Jiang, T Zahavy, Z Xu, A White… - … Machine Learning, 2021 - proceedings.mlr.press

… reinforcement learning agents. We show that naively adapting ETD(λ) to popular deep
reinforcement learning algorithms, … emphatic algorithms for use in the context of such algorithms, …

被引用次数：23 相关文章所有 4 个版本

[PDF] wiley.com Full View

Multi‐robot path planning based on a deep reinforcement learning DQN algorithm

Y Yang, L Juntao, P Lingling - CAAI Transactions on …, 2020 - Wiley Online Library

… However, the optimal performance of the scheduling system algorithm has high … (DQN)
algorithm in a deep reinforcement learning algorithm, which combines the Q-learning algorithm, …

被引用次数：165 相关文章所有 6 个版本

Mobile robot path planning based on improved DDPG reinforcement learning algorithm

Y Dong, X Zou - 2020 IEEE 11th International Conference on …, 2020 - ieeexplore.ieee.org

… robot path planning based on the improved DDPG reinforcement learning algorithm. The
system uses the improved DDPG reinforcement learning algorithm. The content of this paper is …

被引用次数：45 相关文章

[PDF] mlr.press

Off-policy deep reinforcement learning without exploration

S Fujimoto, D Meger, D Precup - … on machine learning, 2019 - proceedings.mlr.press

… We present the first continuous control deep reinforcement learning algorithm which can
learn effectively from arbitrary, fixed batch data, and empirically demonstrate the quality of its …

被引用次数：1498 相关文章所有 9 个版本

[PDF] arxiv.org

Deep reinforcement learning assisted federated learning algorithm for data management of IIoT

P Zhang, C Wang, C Jiang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

… costs, we apply deep reinforcement learning (DRL) to IIoT … Therefore, we propose a FL
algorithm assisted by DRL, which … corroborates the effectiveness of the proposed algorithm. …

被引用次数：152 相关文章所有 4 个版本

[PDF] arxiv.org

An application of deep reinforcement learning to algorithmic trading

T Théate, D Ernst - Expert Systems with Applications, 2021 - Elsevier

… Afterwards, Section 3 introduces and rigorously formalises the particular algorithmic trading
problem considered. Additionally, this section makes the link with the reinforcement learning …

被引用次数：166 相关文章所有 6 个版本

Reinforcement learning in robotic applications: a comprehensive survey

B Singh, R Kumar, VP Singh - Artificial Intelligence Review, 2022 - Springer

… In this paper, a brief overview of the application of reinforcement algorithms in robotic
science is presented. This survey offered a comprehensive review based on segments as (1) …

被引用次数：178 相关文章所有 4 个版本

高级搜索

QQ 群

Challenges of real-world reinforcement learning

When to use parametric models in reinforcement learning?

Model-based reinforcement learning for atari

Emphatic algorithms for deep reinforcement learning

Multi‐robot path planning based on a deep reinforcement learning DQN algorithm

Mobile robot path planning based on improved DDPG reinforcement learning algorithm

Off-policy deep reinforcement learning without exploration

Deep reinforcement learning assisted federated learning algorithm for data management of IIoT

An application of deep reinforcement learning to algorithmic trading

Reinforcement learning in robotic applications: a comprehensive survey

引用