temporal credit assignment problem. Knowledge-based approaches have received a
significant attention in the area. Reward shaping is a particular approach to incorporate
domain knowledge into reinforcement learning. Theoretical and empirical analysis of this
paper reveals important properties of this principle, especially the influence of the reward
type, MDP discount factor, and the way of evaluating the potential function on the …