相关文章- 学术资源搜索

Exploring counterfactual explanations through the lens of adversarial examples: A theoretical and empirical analysis

M Pawelczyk, C Agarwal, S Joshi… - International …, 2022 - proceedings.mlr.press

As machine learning (ML) models becomemore widely deployed in high-stakes
applications, counterfactual explanations have emerged as key tools for providing …

被引用次数：60 相关文章所有 6 个版本

[PDF] arxiv.org

Distal explanations for model-free explainable reinforcement learning

P Madumal, T Miller, L Sonenberg, F Vetere - arXiv preprint arXiv …, 2020 - arxiv.org

In this paper we introduce and evaluate a distal explanation model for model-free
reinforcement learning agents that can generate explanations forwhy'andwhy not'questions …

被引用次数：19 相关文章所有 2 个版本

[PDF] arxiv.org

Counterfactual explanations using optimization with constraint learning

D Maragno, TE Röber, I Birbil - arXiv preprint arXiv:2209.10997, 2022 - arxiv.org

To increase the adoption of counterfactual explanations in practice, several criteria that
these should adhere to have been put forward in the literature. We propose counterfactual …

被引用次数：10 相关文章所有 7 个版本

[PDF] aaai.org

Explaining reinforcement learning agents through counterfactual action outcomes

Y Amitai, Y Septon, O Amir - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Explainable reinforcement learning (XRL) methods aim to help elucidate agent policies and
decision-making processes. The majority of XRL approaches focus on local explanations …

被引用次数：5 相关文章所有 3 个版本

[PDF] aaai.org

Generation of policy-level explanations for reinforcement learning

N Topin, M Veloso - Proceedings of the AAAI Conference on Artificial …, 2019 - ojs.aaai.org

Though reinforcement learning has greatly benefited from the incorporation of neural
networks, the inability to verify the correctness of such systems limits their use. Current work …

被引用次数：82 相关文章所有 9 个版本

[PDF] sciencedirect.com

Explainability in deep reinforcement learning

A Heuillet, F Couthouis, N Díaz-Rodríguez - Knowledge-Based Systems, 2021 - Elsevier

A large set of the explainable Artificial Intelligence (XAI) literature is emerging on feature
relevance techniques to explain a deep neural network (DNN) output or explaining models …

被引用次数：344 相关文章所有 12 个版本

[PDF] arxiv.org

Ganterfactual-rl: Understanding reinforcement learning agents' strategies through visual counterfactual explanations

T Huber, M Demmler, S Mertes, ML Olson… - arXiv preprint arXiv …, 2023 - arxiv.org

Counterfactual explanations are a common tool to explain artificial intelligence models. For
Reinforcement Learning (RL) agents, they answer" Why not?" or" What if?" questions by …

被引用次数：13 相关文章所有 6 个版本

[PDF] neurips.cc

Designing counterfactual generators using deep model inversion

J Thiagarajan, VS Narayanaswamy… - Advances in …, 2021 - proceedings.neurips.cc

Explanation techniques that synthesize small, interpretable changes to a given image while
producing desired changes in the model prediction have become popular for introspecting …

被引用次数：25 相关文章所有 8 个版本

[PDF] psu.edu

A novel policy-graph approach with natural language and counterfactual abstractions for explaining reinforcement learning agents

T Liu, J McCalmon, T Le, MA Rahman, D Lee… - Autonomous Agents and …, 2023 - Springer

As reinforcement learning (RL) continues to improve and be applied in situations alongside
humans, the need to explain the learned behaviors of RL agents to end-users becomes …

被引用次数：3 相关文章所有 4 个版本

[PDF] mlr.press

Counterfactual explanation trees: Transparent and consistent actionable recourse with decision trees

K Kanamori, T Takagi… - … Conference on Artificial …, 2022 - proceedings.mlr.press

Counterfactual Explanation (CE) is a post-hoc explanation method that provides a
perturbation for altering the prediction result of a classifier. An individual can interpret the …

被引用次数：29 相关文章所有 4 个版本

高级搜索

QQ 群

Exploring counterfactual explanations through the lens of adversarial examples: A theoretical and empirical analysis

Distal explanations for model-free explainable reinforcement learning

Counterfactual explanations using optimization with constraint learning

Explaining reinforcement learning agents through counterfactual action outcomes

Generation of policy-level explanations for reinforcement learning

Explainability in deep reinforcement learning

Ganterfactual-rl: Understanding reinforcement learning agents' strategies through visual counterfactual explanations

Designing counterfactual generators using deep model inversion

A novel policy-graph approach with natural language and counterfactual abstractions for explaining reinforcement learning agents

Counterfactual explanation trees: Transparent and consistent actionable recourse with decision trees

引用