Toward practical and plausible counterfactual explanation through latent adjustment in disentangled space

SH Na, WJ Nam, SW Lee - Expert Systems with Applications, 2023 - Elsevier
Extensive research into eXplainable AI (XAI) has raised interest in generating counterfactual
(CF) explanations. In the past, minimizing the perturbation of input was considered a priority …

Explainable reinforcement learning via model transforms

M Finkelstein, L Liu, Y Kolumbus… - Advances in …, 2022 - proceedings.neurips.cc
Understanding emerging behaviors of reinforcement learning (RL) agents may be difficult
since such agents are often trained in complex environments using highly complex decision …

Leveraging reward consistency for interpretable feature discovery in reinforcement learning

Q Yang, H Wang, M Tong, W Shi… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The black-box nature of deep reinforcement learning (RL) hinders them from real-world
applications. Therefore, interpreting and explaining RL agents have been active research …

Counterfactual evaluation for explainable AI

Y Ge, S Liu, Z Li, S Xu, S Geng, Y Li, J Tan… - arXiv preprint arXiv …, 2021 - arxiv.org
While recent years have witnessed the emergence of various explainable methods in
machine learning, to what degree the explanations really represent the reasoning process …

Counterfactual state explanations for reinforcement learning agents via generative deep learning

ML Olson, R Khanna, L Neal, F Li, WK Wong - Artificial Intelligence, 2021 - Elsevier
Counterfactual explanations, which deal with “why not?” scenarios, can provide insightful
explanations to an AI agent's behavior [Miller [38]]. In this work, we focus on generating …

[PDF][PDF] Counterfactual explanations for machine learning: A review

S Verma, J Dickerson, K Hines - arXiv preprint arXiv …, 2020 - ml-retrospectives.github.io
Abstract Machine learning plays a role in many deployed decision systems, often in ways
that are difficult or impossible to understand by human stakeholders. Explaining, in a human …

Density-based reliable and robust explainer for counterfactual explanation

S Zhang, X Chen, S Wen, Z Li - Expert Systems with Applications, 2023 - Elsevier
As an essential post-hoc explanatory method, counterfactual explanation enables people to
understand and react to machine learning models. Works on counterfactual explanation …

[PDF][PDF] Distal explanations for explainable reinforcement learning agents

P Madumal, T Miller, L Sonenberg… - arXiv preprint arXiv …, 2020 - researchgate.net
Causal explanations present an intuitive way to understand the course of events through
causal chains, and are widely accepted in cognitive science as the prominent model …

Discern: Discovering counterfactual explanations using relevance features from neighbourhoods

N Wiratunga, A Wijekoon, I Nkisi-Orji… - 2021 IEEE 33rd …, 2021 - ieeexplore.ieee.org
Counterfactual explanations focus on" actionable knowledge" to help end-users understand
how a machine learning outcome could be changed to a more desirable outcome. For this …

Edge: Explaining deep reinforcement learning policies

W Guo, X Wu, U Khan, X Xing - Advances in Neural …, 2021 - proceedings.neurips.cc
With the rapid development of deep reinforcement learning (DRL) techniques, there is an
increasing need to understand and interpret DRL policies. While recent research has …