Formalising the robustness of counterfactual explanations for neural networks

J Jiang, F Leofante, A Rago, F Toni - … of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
The use of counterfactual explanations (CFXs) is an increasingly popular explanation
strategy for machine learning models. However, recent studies have shown that these …

Convex optimization for actionable\& plausible counterfactual explanations

A Artelt, B Hammer - arXiv preprint arXiv:2105.07630, 2021 - arxiv.org
Transparency is an essential requirement of machine learning based decision making
systems that are deployed in real world. Often, transparency of a given system is achieved …

Complementary reinforcement learning towards explainable agents

JH Lee - arXiv preprint arXiv:1901.00188, 2019 - arxiv.org
Reinforcement learning (RL) algorithms allow agents to learn skills and strategies to perform
complex tasks without detailed instructions or expensive labelled training examples. That is …

ECINN: efficient counterfactuals from invertible neural networks

F Hvilshøj, A Iosifidis, I Assent - arXiv preprint arXiv:2103.13701, 2021 - arxiv.org
Counterfactual examples identify how inputs can be altered to change the predicted class of
a classifier, thus opening up the black-box nature of, eg, deep neural networks. We propose …

Uncertainty-aware action advising for deep reinforcement learning agents

FL Da Silva, P Hernandez-Leal, B Kartal… - Proceedings of the AAAI …, 2020 - ojs.aaai.org
Abstract Although Reinforcement Learning (RL) has been one of the most successful
approaches for learning in sequential decision making problems, the sample-complexity of …

Counterfactual explanations for machine learning: Challenges revisited

S Verma, J Dickerson, K Hines - arXiv preprint arXiv:2106.07756, 2021 - arxiv.org
Counterfactual explanations (CFEs) are an emerging technique under the umbrella of
interpretability of machine learning (ML) models. They provide``what if''feedback of the …

[PDF][PDF] Distilling deep reinforcement learning policies in soft decision trees

Y Coppens, K Efthymiadis, T Lenaerts… - Proceedings of the …, 2019 - dipot.ulb.ac.be
An important step in Reinforcement Learning (RL) research is to create mechanisms that
give higher level insights into the black-box policy models used nowadays and provide …

Consistent counterfactuals for deep models

E Black, Z Wang, M Fredrikson, A Datta - arXiv preprint arXiv:2110.03109, 2021 - arxiv.org
Counterfactual examples are one of the most commonly-cited methods for explaining the
predictions of machine learning models in key areas such as finance and medical diagnosis …

Scout: Self-aware discriminant counterfactual explanations

P Wang, N Vasconcelos - … of the IEEE/CVF Conference on …, 2020 - openaccess.thecvf.com
The problem of counterfactual visual explanations is considered. A new family of
discriminant explanations is introduced. These produce heatmaps that attribute high scores …

Causal reasoning from meta-reinforcement learning

I Dasgupta, J Wang, S Chiappa, J Mitrovic… - arXiv preprint arXiv …, 2019 - arxiv.org
Discovering and exploiting the causal structure in the environment is a crucial challenge for
intelligent agents. Here we explore whether causal reasoning can emerge via meta …