Explaining reinforcement learning to mere mortals: An empirical study

A Anderson, J Dodge, A Sadarangani… - arXiv preprint arXiv …, 2019 - arxiv.org
We present a user study to investigate the impact of explanations on non-experts'
understanding of reinforcement learning (RL) agents. We investigate both a common RL …

Mental models of mere mortals with explanations of reinforcement learning

A Anderson, J Dodge, A Sadarangani… - ACM Transactions on …, 2020 - dl.acm.org
How should reinforcement learning (RL) agents explain themselves to humans not trained in
AI? To gain insights into this question, we conducted a 124-participant, four-treatment …

[PDF][PDF] Distal explanations for explainable reinforcement learning agents

P Madumal, T Miller, L Sonenberg… - arXiv preprint arXiv …, 2020 - researchgate.net
Causal explanations present an intuitive way to understand the course of events through
causal chains, and are widely accepted in cognitive science as the prominent model …

Explaining reinforcement learning agents through counterfactual action outcomes

Y Amitai, Y Septon, O Amir - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Explainable reinforcement learning (XRL) methods aim to help elucidate agent policies and
decision-making processes. The majority of XRL approaches focus on local explanations …

Human-level reinforcement learning through theory-based modeling, exploration, and planning

PA Tsividis, J Loula, J Burga, N Foss… - arXiv preprint arXiv …, 2021 - arxiv.org
Reinforcement learning (RL) studies how an agent comes to achieve reward in an
environment through interactions over time. Recent advances in machine RL have …

Distal explanations for model-free explainable reinforcement learning

P Madumal, T Miller, L Sonenberg, F Vetere - arXiv preprint arXiv …, 2020 - arxiv.org
In this paper we introduce and evaluate a distal explanation model for model-free
reinforcement learning agents that can generate explanations forwhy'andwhy not'questions …

Integrating policy summaries with reward decomposition for explaining reinforcement learning agents

Y Septon, T Huber, E André, O Amir - … of Agents and Multi-Agent Systems, 2023 - Springer
Explainable reinforcement learning methods can roughly be divided into local explanations
that analyze specific decisions of the agents and global explanations that convey the …

Toward a psychology of deep reinforcement learning agents using a cognitive architecture

K Mitsopoulos, S Somers, J Schooler… - Topics in Cognitive …, 2022 - Wiley Online Library
We argue that cognitive models can provide a common ground between human users and
deep reinforcement learning (Deep RL) algorithms for purposes of explainable artificial …

Contrastive explanations for reinforcement learning in terms of expected consequences

J van der Waa, J van Diggelen, K Bosch… - arXiv preprint arXiv …, 2018 - arxiv.org
Machine Learning models become increasingly proficient in complex tasks. However, even
for experts in the field, it can be difficult to understand what the model learned. This hampers …

Ganterfactual-rl: Understanding reinforcement learning agents' strategies through visual counterfactual explanations

T Huber, M Demmler, S Mertes, ML Olson… - arXiv preprint arXiv …, 2023 - arxiv.org
Counterfactual explanations are a common tool to explain artificial intelligence models. For
Reinforcement Learning (RL) agents, they answer" Why not?" or" What if?" questions by …