J Zhu, Y Wei, Y Kang, X Jiang, GE Dullerud - Science China Information …, 2022 - Springer
Deep reinforcement learning (DRL) is currently used to solve Markov decision process
problems for which the environment is typically assumed to be stationary. In this paper, we …