Model-agnostic and scalable counterfactual explanations via reinforcement learning

RF Samoilescu, A Van Looveren, J Klaise - arXiv preprint arXiv …, 2021 - arxiv.org
Counterfactual instances are a powerful tool to obtain valuable insights into automated
decision processes, describing the necessary minimal changes in the input space to alter …

[PDF][PDF] Relace: Reinforcement learning agent for counterfactual explanations of arbitrary predictive models

Z Chen, F Silvestri, G Tolomei, H Zhu… - arXiv preprint arXiv …, 2021 - researchgate.net
The demand for explainable machine learning (ML) models has been growing rapidly in
recent years. Amongst the methods proposed to associate ML model predictions with …

Conditional generative models for counterfactual explanations

A Van Looveren, J Klaise, G Vacanti… - arXiv preprint arXiv …, 2021 - arxiv.org
Counterfactual instances offer human-interpretable insight into the local behaviour of
machine learning models. We propose a general framework to generate sparse, in …

Exploring the trade-off between plausibility, change intensity and adversarial power in counterfactual explanations using multi-objective optimization

J Del Ser, A Barredo-Arrieta, N Díaz-Rodríguez… - arXiv preprint arXiv …, 2022 - arxiv.org
There is a broad consensus on the importance of deep learning models in tasks involving
complex data. Often, an adequate understanding of these models is required when focusing …

[PDF][PDF] BayCon: Model-agnostic Bayesian Counterfactual Generator.

P Romashov, M Gjoreski, K Sokol, MV Martinez… - IJCAI, 2022 - uc.inf.usi.ch
Generating counterfactuals to discover hypothetical predictive scenarios is the de facto
standard for explaining machine learning models and their predictions. However, building a …

Counterfactual explanations using optimization with constraint learning

D Maragno, TE Röber, I Birbil - arXiv preprint arXiv:2209.10997, 2022 - arxiv.org
To increase the adoption of counterfactual explanations in practice, several criteria that
these should adhere to have been put forward in the literature. We propose counterfactual …

Mace: An efficient model-agnostic framework for counterfactual explanation

W Yang, J Li, C Xiong, SCH Hoi - arXiv preprint arXiv:2205.15540, 2022 - arxiv.org
Counterfactual explanation is an important Explainable AI technique to explain machine
learning predictions. Despite being studied actively, existing optimization-based methods …

[PDF][PDF] CounterNet: End-to-end training of counterfactual aware predictions

H Guo, T Nguyen, A Yadav - ICML Workshop on Algorithmic …, 2021 - amulyayadav.com
This work presents CounterNet, a novel endto-end learning framework which integrates the
predictive model training and the counterfactual (CF) explanation into a single end-to-end …

Explain the explainer: Interpreting model-agnostic counterfactual explanations of a deep reinforcement learning agent

Z Chen, F Silvestri, G Tolomei, J Wang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Counterfactual examples (CFs) are one of the most popular methods for attaching post hoc
explanations to machine learning models. However, existing CF generation methods either …

Are Data-driven Explanations Robust against Out-of-distribution Data?

T Li, F Qiao, M Ma, X Peng - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
As black-box models increasingly power high-stakes applications, a variety of data-driven
explanation methods have been introduced. Meanwhile, machine learning models are …