Explaining deep adaptive programs via reward decomposition

M Erwig, A Fern, M Murali, A Koul - IJCAI/ECAI workshop on explainable …, 2018 - par.nsf.gov
Adaptation Based Programming (ABP) allows programmers to employ" choice points" at
program locations where they are uncertain about how to best code the program logic.
Reinforcement learning (RL) is then used to automatically learn to make choice-point
decisions to optimize the reward achieved by the program. In this paper, we consider a new
approach to explaining the learned decisions of adaptive programs. The key idea is to
include simple program annotations that define multiple semantically meaningful reward …

[引用][C] Explaining deep adaptive programs via reward decomposition. IJCAI

M Erwig, A Fern, M Murali, A Koul - ECAI Workshop on Explainable Artificial …, 2018
以上显示的是最相近的搜索结果。 查看全部搜索结果