Y Jia,
XY Zhou - Journal of Machine Learning Research, 2023 - jmlr.org
We study the continuous-time counterpart of Q-learning for reinforcement learning (RL)
under the entropy-regularized, exploratory diffusion process formulation introduced by Wang …