L Meng, R Gorbet, D Kulic - 2020 25th International Conference on …, 2021 - computer.org
Multi-step (also called n-step) methods in Reinforcement Learning (RL) have been shown to
be more efficient than the 1-step method due to faster propagation of the reward signal, both …