Theoretical and empirical analysis of reward shaping in reinforcement learning

M Grzes, D Kudenko - 2009 International Conference on …, 2009 - ieeexplore.ieee.org
2009 International Conference on Machine Learning and Applications, 2009ieeexplore.ieee.org
Reinforcement learning suffers scalability problems due to the state space explosion and the
temporal credit assignment problem. Knowledge-based approaches have received a
significant attention in the area. Reward shaping is a particular approach to incorporate
domain knowledge into reinforcement learning. Theoretical and empirical analysis of this
paper reveals important properties of this principle, especially the influence of the reward
type, MDP discount factor, and the way of evaluating the potential function on the …
Reinforcement learning suffers scalability problems due to the state space explosion and the temporal credit assignment problem. Knowledge-based approaches have received a significant attention in the area. Reward shaping is a particular approach to incorporate domain knowledge into reinforcement learning. Theoretical and empirical analysis of this paper reveals important properties of this principle, especially the influence of the reward type, MDP discount factor, and the way of evaluating the potential function on the performance.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果