K Lin, D Li, Y Li, S Chen, Q Liu, J Gao… - IEEE Transactions on …, 2023 - europepmc.org
Reinforcement learning (RL) still suffers from the problem of sample inefficiency and
struggles with the exploration issue, particularly in situations with long-delayed rewards …