Challenges of real-world reinforcement learning

G Dulac-Arnold, D Mankowitz, T Hester - arXiv preprint arXiv:1904.12901, 2019 - arxiv.org
learning algorithm needs to learn quickly from potential mistakes without needing to repeat
them multiple times before fixing them. Thus, learning on a real system requires an algorithm

When to use parametric models in reinforcement learning?

HP Van Hasselt, M Hessel… - Advances in Neural …, 2019 - proceedings.neurips.cc
… Although off-policy learning algorithms may be part of the long-term answer, we do not
yet have a definitive solution. To quote Sutton and Barto [2018]: The potential for off-policy …

Model-based reinforcement learning for atari

L Kaiser, M Babaeizadeh, P Milos, B Osinski… - arXiv preprint arXiv …, 2019 - arxiv.org
… However, some of the best model-free reinforcement learning algorithms require tens or
hundreds of millions of time steps – the equivalent of several weeks of training in real time. How …

Emphatic algorithms for deep reinforcement learning

R Jiang, T Zahavy, Z Xu, A White… - … Machine Learning, 2021 - proceedings.mlr.press
reinforcement learning agents. We show that naively adapting ETD(λ) to popular deep
reinforcement learning algorithms, … emphatic algorithms for use in the context of such algorithms, …

Multi‐robot path planning based on a deep reinforcement learning DQN algorithm

Y Yang, L Juntao, P Lingling - CAAI Transactions on …, 2020 - Wiley Online Library
… However, the optimal performance of the scheduling system algorithm has high … (DQN)
algorithm in a deep reinforcement learning algorithm, which combines the Q-learning algorithm, …

Mobile robot path planning based on improved DDPG reinforcement learning algorithm

Y Dong, X Zou - 2020 IEEE 11th International Conference on …, 2020 - ieeexplore.ieee.org
… robot path planning based on the improved DDPG reinforcement learning algorithm. The
system uses the improved DDPG reinforcement learning algorithm. The content of this paper is …

Off-policy deep reinforcement learning without exploration

S Fujimoto, D Meger, D Precup - … on machine learning, 2019 - proceedings.mlr.press
… We present the first continuous control deep reinforcement learning algorithm which can
learn effectively from arbitrary, fixed batch data, and empirically demonstrate the quality of its …

Deep reinforcement learning assisted federated learning algorithm for data management of IIoT

P Zhang, C Wang, C Jiang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
… costs, we apply deep reinforcement learning (DRL) to IIoT … Therefore, we propose a FL
algorithm assisted by DRL, which … corroborates the effectiveness of the proposed algorithm. …

An application of deep reinforcement learning to algorithmic trading

T Théate, D Ernst - Expert Systems with Applications, 2021 - Elsevier
… Afterwards, Section 3 introduces and rigorously formalises the particular algorithmic trading
problem considered. Additionally, this section makes the link with the reinforcement learning

Reinforcement learning in robotic applications: a comprehensive survey

B Singh, R Kumar, VP Singh - Artificial Intelligence Review, 2022 - Springer
… In this paper, a brief overview of the application of reinforcement algorithms in robotic
science is presented. This survey offered a comprehensive review based on segments as (1) …