H Shi, W Gao, X Jiang, C Su, P Li - International Journal of Control, 2023 - Taylor & Francis
A novel two-dimensional (2D) off-policy interleaved Q-learning algorithm is proposed to
handle the optimal tracking control problem without prior knowledge of nonlinear batch …