Explain the explainer: Interpreting model-agnostic counterfactual explanations of a deep reinforcement learning agent

Z Chen, F Silvestri, G Tolomei, J Wang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Counterfactual examples (CFs) are one of the most popular methods for attaching post hoc
explanations to machine learning models. However, existing CF generation methods either …

ReLAX: Reinforcement Learning Agent Explainer for Arbitrary Predictive Models

Z Chen, F Silvestri, J Wang, H Zhu, H Ahn… - Proceedings of the 31st …, 2022 - dl.acm.org
Counterfactual examples (CFs) are one of the most popular methods for attaching post-hoc
explanations to machine learning (ML) models. However, existing CF generation methods …

[PDF][PDF] Relace: Reinforcement learning agent for counterfactual explanations of arbitrary predictive models

Z Chen, F Silvestri, G Tolomei, H Zhu… - arXiv preprint arXiv …, 2021 - researchgate.net
The demand for explainable machine learning (ML) models has been growing rapidly in
recent years. Amongst the methods proposed to associate ML model predictions with …

Model-agnostic and scalable counterfactual explanations via reinforcement learning

RF Samoilescu, A Van Looveren, J Klaise - arXiv preprint arXiv …, 2021 - arxiv.org
Counterfactual instances are a powerful tool to obtain valuable insights into automated
decision processes, describing the necessary minimal changes in the input space to alter …

Redefining Counterfactual Explanations for Reinforcement Learning: Overview, Challenges and Opportunities

J Gajcin, I Dusparic - ACM Computing Surveys, 2024 - dl.acm.org
While AI algorithms have shown remarkable success in various fields, their lack of
transparency hinders their application to real-life tasks. Although explanations targeted at …

Counternet: End-to-end training of prediction aware counterfactual explanations

H Guo, TH Nguyen, A Yadav - Proceedings of the 29th ACM SIGKDD …, 2023 - dl.acm.org
This work presents CounterNet, a novel end-to-end learning framework which integrates
Machine Learning (ML) model training and the generation of corresponding counterfactual …

Ordered counterfactual explanation by mixed-integer linear optimization

K Kanamori, T Takagi, K Kobayashi, Y Ike… - Proceedings of the …, 2021 - ojs.aaai.org
Post-hoc explanation methods for machine learning models have been widely used to
support decision-making. One of the popular methods is Counterfactual Explanation (CE) …

Explainable deep reinforcement learning: state of the art and challenges

GA Vouros - ACM Computing Surveys, 2022 - dl.acm.org
Interpretability, explainability, and transparency are key issues to introducing artificial
intelligence methods in many critical domains. This is important due to ethical concerns and …

Statemask: Explaining deep reinforcement learning through state mask

Z Cheng, X Wu, J Yu, W Sun… - Advances in Neural …, 2024 - proceedings.neurips.cc
Despite the promising performance of deep reinforcement learning (DRL) agents in many
challenging scenarios, the black-box nature of these agents greatly limits their applications …

Cdt: Cascading decision trees for explainable reinforcement learning

Z Ding, P Hernandez-Leal, GW Ding, C Li… - arXiv preprint arXiv …, 2020 - arxiv.org
Deep Reinforcement Learning (DRL) has recently achieved significant advances in various
domains. However, explaining the policy of RL agents still remains an open problem due to …