B Wang, X Li, Y Chen,
J Wu, B Zeng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Value function approximation, such as Q-learning, is widely used in the discrete control
rather than the continuous one because the optimal action in the discrete setting is more …