H Yang,
S Li, X Xu, X Liu, Z Meng, Y Zhang - IEEE Access, 2021 - ieeexplore.ieee.org
Pommerman is a popular reinforcement learning environment because it imposes several
challenges such as sparse and deceptive rewards and delayed action effects. In this paper …