Redefining Counterfactual Explanations for Reinforcement Learning: Overview, Challenges and Opportunities

J Gajcin, I Dusparic - ACM Computing Surveys, 2024 - dl.acm.org
counterfactual explanations in supervised learning. Additionally, we explore the diferences
between counterfactual explanations in supervised learning … in reinforcement learning. Finally…

Model-agnostic and scalable counterfactual explanations via reinforcement learning

RF Samoilescu, A Van Looveren, J Klaise - arXiv preprint arXiv …, 2021 - arxiv.org
learning models (eg differentiable or tree-based models). In this work, we propose a deep
reinforcement learning … us to generate batches of counterfactual instances in a single forward …

Counterfactual state explanations for reinforcement learning agents via generative deep learning

ML Olson, R Khanna, L Neal, F Li, WK Wong - Artificial Intelligence, 2021 - Elsevier
explanations to an AI agent's behavior [Miller [38]]. In this work, we focus on generating
counterfactual explanations for deep reinforcement learning (… counterfactual state explanations, a …

[PDF][PDF] Relace: Reinforcement learning agent for counterfactual explanations of arbitrary predictive models

Z Chen, F Silvestri, G Tolomei, H Zhu… - arXiv preprint arXiv …, 2021 - researchgate.net
… The latter aims at generating ex post explanations for … post-hoc explanations called
counterfactual explanations, which … from the predictive model, ie, counterfactual examples (CFs). …

Explain the explainer: Interpreting model-agnostic counterfactual explanations of a deep reinforcement learning agent

Z Chen, F Silvestri, G Tolomei, J Wang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
… RELAX – the counterfactual explanation technique based on a deep reinforcement learning
… Our method is modelagnostic: it can generate valid explanations for any model that are …

Explainable reinforcement learning through a causal lens

P Madumal, T Miller, L Sonenberg, F Vetere - Proceedings of the AAAI …, 2020 - aaai.org
counterfactuals to be explained; something that is known to be a powerful explanation mech…
Our model generates counterfactual explanations by comparing causal chains of the actual …

Ganterfactual-rl: Understanding reinforcement learning agents' strategies through visual counterfactual explanations

T Huber, M Demmler, S Mertes, ML Olson… - arXiv preprint arXiv …, 2023 - arxiv.org
… whether counterfactual explanationscounterfactual explanations for RL agents by
formulating the problem as a domain transfer problem which allows the use of adversarial learning

Exploratory not explanatory: Counterfactual analysis of saliency maps for deep reinforcement learning

A Atrey, K Clary, D Jensen - arXiv preprint arXiv:1912.05743, 2019 - arxiv.org
explanations of the behavior of deep reinforcement learning (… indicates that the derived
explanations are often unfalsifiable … approach grounded in counterfactual reasoning to test the …

Raccer: Towards reachable and certain counterfactual explanations for reinforcement learning

J Gajcin, I Dusparic - arXiv preprint arXiv:2303.04475, 2023 - arxiv.org
… In this section, we describe RACCER, our approach for generating counterfactual explanations
for RL tasks. To generate a counterfactual explanation x′, we require oracle access to …

Explaining reinforcement learning agents through counterfactual action outcomes

Y Amitai, Y Septon, O Amir - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
… Counterfactual explanations are a form of contrastive explanations. A counterfactual
explanation provides an answer to the question “Why p rather than q?”, where p is the fact which oc…