YH Li, F Zhang, Q Hua, XH Zhou - Knowledge-Based Systems, 2024 - Elsevier
112 天前 - … Off-policy reinforcement learning (RL) algorithms are known … The empirical
results in Section 5 demonstrated that … For computational simplicity, we test the SAC with two …