A hierarchical Bayesian approach to inverse reinforcement learning with symbolic reward machines- 学术资源搜索

A hierarchical Bayesian approach to inverse reinforcement learning with symbolic reward machines

W Zhou, W Li - International Conference on Machine …, 2022 - proceedings.mlr.press

International Conference on Machine Learning, 2022•proceedings.mlr.press

Abstract

A misspecified reward can degrade sample efficiency and induce undesired behaviors in reinforcement learning (RL) problems. We propose symbolic reward machines for incorporating high-level task knowledge when specifying the reward signals. Symbolic reward machines augment existing reward machine formalism by allowing transitions to carry predicates and symbolic reward outputs. This formalism lends itself well to inverse reinforcement learning, whereby the key challenge is determining appropriate assignments to the symbolic values from a few expert demonstrations. We propose a hierarchical Bayesian approach for inferring the most likely assignments such that the concretized reward machine can discriminate expert demonstrated trajectories from other trajectories with high accuracy. Experimental results show that learned reward machines can significantly improve training efficiency for complex RL tasks and generalize well across different task environment configurations.

proceedings.mlr.press

展开收起

被引用次数：3 相关文章所有 4 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

A hierarchical Bayesian approach to inverse reinforcement learning with symbolic reward machines

引用