G Jia, J Huo, F Yang, B Yang - Information Processing & Management, 2024 - Elsevier
The trade-off between exploration and exploitation has been one of the main challenges for
ensuring sampling efficiency, optimal solution, and transferability of reinforcement learning …