O Marom, B Rosman - Proceedings of the AAAI Conference on Artificial …, 2018 - cir.nii.ac.jp
抄録< jats: p> A key challenge in many reinforcement learning problems is delayed rewards,
which can significantly slow down learning. Although reward shaping has previously been …