L Wang, Y Zhang,
Y Hu,
W Wang… - International …, 2022 - proceedings.mlr.press
In many real-world multi-agent systems, the sparsity of team rewards often makes it difficult
for an algorithm to successfully learn a cooperative team policy. At present, the common way …