J Adebayo,
M Muelly, H Abelson… - International conference on …, 2022 - openreview.net
We investigate whether three types of post hoc model explanations–feature attribution,
concept activation, and training point ranking–are effective for detecting a model's reliance …