Explainable reinforcement learning: A survey and comparative review

S Milani, N Topin, M Veloso, F Fang - ACM Computing Surveys, 2024 - dl.acm.org
Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine
learning that has attracted considerable attention in recent years. The goal of XRL is to …

On the role of knowledge graphs in explainable AI

F Lecue - Semantic Web, 2020 - content.iospress.com
The current hype of Artificial Intelligence (AI) mostly refers to the success of machine
learning and its sub-domain of deep learning. However, AI is also about other areas, such …

Reinforcement learning interpretation methods: A survey

A Alharin, TN Doan, M Sartipi - IEEE Access, 2020 - ieeexplore.ieee.org
Reinforcement Learning (RL) systems achieved outstanding performance in different
domains such as Atari games, finance, healthcare, and self-driving cars. However, their …

Explainable reinforcement learning for broad-xai: a conceptual framework and survey

R Dazeley, P Vamplew, F Cruz - Neural Computing and Applications, 2023 - Springer
Broad-XAI moves away from interpreting individual decisions based on a single datum and
aims to provide integrated explanations from multiple machine learning algorithms into a …

The need for verbal robot explanations and how people would like a robot to explain itself

Z Han, E Phillips, HA Yanco - ACM Transactions on Human-Robot …, 2021 - dl.acm.org
Although non-verbal cues such as arm movement and eye gaze can convey robot intention,
they alone may not provide enough information for a human to fully understand a robot's …

[HTML][HTML] Exploring computational user models for agent policy summarization

I Lage, D Lifschitz, F Doshi-Velez… - IJCAI: proceedings of the …, 2019 - ncbi.nlm.nih.gov
AI agents support high stakes decision-making processes from driving cars to prescribing
drugs, making it increasingly important for human users to understand their behavior. Policy …

[HTML][HTML] Local and global explanations of agent behavior: Integrating strategy summaries with saliency maps

T Huber, K Weitz, E André, O Amir - Artificial Intelligence, 2021 - Elsevier
With advances in reinforcement learning (RL), agents are now being developed in high-
stakes application domains such as healthcare and transportation. Explaining the behavior …

An empirical study of reward explanations with human-robot interaction applications

L Sanneman, JA Shah - IEEE Robotics and Automation Letters, 2022 - ieeexplore.ieee.org
Explainable AI techniques that describe agent reward functions can enhance human-robot
collaboration in a variety of settings. However, in order to effectively explain reward …

TEAMSTER: Model-based reinforcement learning for ad hoc teamwork

JG Ribeiro, G Rodrigues, A Sardinha, FS Melo - Artificial Intelligence, 2023 - Elsevier
This paper investigates the use of model-based reinforcement learning in the context of ad
hoc teamwork. We introduce a novel approach, named TEAMSTER, where we propose …

Transparent value alignment

L Sanneman, J Shah - Companion of the 2023 ACM/IEEE International …, 2023 - dl.acm.org
As robots become increasingly prevalent in our communities, aligning the values motivating
their behavior with human values is critical. However, it is often difficult or impossible for …