Statemask: Explaining deep reinforcement learning through state mask

Z Cheng, X Wu, J Yu, W Sun… - Advances in Neural …, 2024 - proceedings.neurips.cc
Despite the promising performance of deep reinforcement learning (DRL) agents in many
challenging scenarios, the black-box nature of these agents greatly limits their applications …

Edge: Explaining deep reinforcement learning policies

W Guo, X Wu, U Khan, X Xing - Advances in Neural …, 2021 - proceedings.neurips.cc
With the rapid development of deep reinforcement learning (DRL) techniques, there is an
increasing need to understand and interpret DRL policies. While recent research has …

Cdt: Cascading decision trees for explainable reinforcement learning

Z Ding, P Hernandez-Leal, GW Ding, C Li… - arXiv preprint arXiv …, 2020 - arxiv.org
Deep Reinforcement Learning (DRL) has recently achieved significant advances in various
domains. However, explaining the policy of RL agents still remains an open problem due to …

Leveraging reward consistency for interpretable feature discovery in reinforcement learning

Q Yang, H Wang, M Tong, W Shi… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The black-box nature of deep reinforcement learning (RL) hinders them from real-world
applications. Therefore, interpreting and explaining RL agents have been active research …

Self-supervised discovering of interpretable features for reinforcement learning

W Shi, G Huang, S Song, Z Wang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Deep reinforcement learning (RL) has recently led to many breakthroughs on a range of
complex control tasks. However, the agent's decision-making process is generally not …

Counterfactual state explanations for reinforcement learning agents via generative deep learning

ML Olson, R Khanna, L Neal, F Li, WK Wong - Artificial Intelligence, 2021 - Elsevier
Counterfactual explanations, which deal with “why not?” scenarios, can provide insightful
explanations to an AI agent's behavior [Miller [38]]. In this work, we focus on generating …

Reccover: Detecting causal confusion for explainable reinforcement learning

J Gajcin, I Dusparic - … , Transparent Autonomous Agents and Multi-Agent …, 2022 - Springer
Despite notable results in various fields over the recent years, deep reinforcement learning
(DRL) algorithms lack transparency, affecting user trust and hindering deployment to high …

Interpretable model-based hierarchical reinforcement learning using inductive logic programming

D Xu, F Fekri - arXiv preprint arXiv:2106.11417, 2021 - arxiv.org
Recently deep reinforcement learning has achieved tremendous success in wide ranges of
applications. However, it notoriously lacks data-efficiency and interpretability. Data-efficiency …

A survey on explainable reinforcement learning: Concepts, algorithms, challenges

Y Qing, S Liu, J Song, H Wang, M Song - arXiv preprint arXiv:2211.06665, 2022 - arxiv.org
Reinforcement Learning (RL) is a popular machine learning paradigm where intelligent
agents interact with the environment to fulfill a long-term goal. Driven by the resurgence of …

Ganterfactual-rl: Understanding reinforcement learning agents' strategies through visual counterfactual explanations

T Huber, M Demmler, S Mertes, ML Olson… - arXiv preprint arXiv …, 2023 - arxiv.org
Counterfactual explanations are a common tool to explain artificial intelligence models. For
Reinforcement Learning (RL) agents, they answer" Why not?" or" What if?" questions by …