ZC Zhang, KS Hu, HY Huang, S Li… - Applied Mechanics and …, 2011 - Trans Tech Publ
… Similarly to the construction of Sarsa(λ,k), we could construct the multi-step variants of
other RL algorithms. For example, Q(λ,k) could be proposed based on Q-learning. …