D Ye, G Chen, W Zhang, S Chen… - Advances in …, 2020 - proceedings.neurips.cc
MOBA games, eg, Honor of Kings, League of Legends, and Dota 2, pose grand challenges to AI systems such as multi-agent, enormous state-action space, complex action control, etc …
D Ye, G Chen, P Zhao, F Qiu, B Yuan… - … on Neural Networks …, 2020 - ieeexplore.ieee.org
We present JueWu-SL, the first supervised-learning-based artificial intelligence (AI) program that achieves human-level performance in playing multiplayer online battle arena (MOBA) …
C Wang, Y Wu, Q Vuong… - … Conference on Machine …, 2020 - proceedings.mlr.press
We aim to develop off-policy DRL algorithms that not only exceed state-of-the-art performance but are also simple and minimalistic. For standard continuous control …
S Han, W Zhou, S Lü, J Yu - Knowledge-Based Systems, 2021 - Elsevier
Abstract Deep Deterministic Policy Gradient (DDPG) algorithm is one of the most well-known reinforcement learning methods. However, this method is inefficient and unstable in practical …
Policy gradient methods can solve complex tasks but often fail when the dimensionality of the action-space or objective multiplicity grow very large. This occurs, in part, because the …
Y Yang, D Xing, W Xia, P Wang - Machine Intelligence Research, 2025 - Springer
Reinforcement learning encounters formidable challenges when tasked with intricate decision-making scenarios, primarily due to the expansive parameterized action spaces and …
The field of Deep Reinforcement Learning (DRL) has recently seen a surge in the popularity of maximum entropy reinforcement learning algorithms. Their popularity stems from the …
Sample efficiency in deep reinforcement learning (DRL) is measured by the amount of new data collected to learn a task. It is one of the most important research topics in DRL …
Action spaces equipped with parameter sets are a common occurrence in reinforcement learning applications. Solutions to problems of this class have been developed under …