Explainability in deep reinforcement learning: A review into current methods and applications

T Hickling, A Zenati, N Aouf, P Spencer - ACM Computing Surveys, 2023 - dl.acm.org
The use of Deep Reinforcement Learning (DRL) schemes has increased dramatically since
their first introduction in 2015. Though uses in many different applications are being found …

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

Reinforcement learning interpretation methods: A survey

A Alharin, TN Doan, M Sartipi - IEEE Access, 2020 - ieeexplore.ieee.org
Reinforcement Learning (RL) systems achieved outstanding performance in different
domains such as Atari games, finance, healthcare, and self-driving cars. However, their …

Bridging the human-ai knowledge gap: Concept discovery and transfer in alphazero

L Schut, N Tomasev, T McGrath, D Hassabis… - arXiv preprint arXiv …, 2023 - arxiv.org
Artificial Intelligence (AI) systems have made remarkable progress, attaining super-human
performance across various domains. This presents us with an opportunity to further human …

Shielded Reinforcement Learning: A review of reactive methods for safe learning

H Odriozola-Olalde, M Zamalloa… - 2023 IEEE/SICE …, 2023 - ieeexplore.ieee.org
Reinforcement Learning (RL) algorithms are showing promising results in simulated
environments, but their replication in real physical applications, even more so in safety …

[HTML][HTML] Verifying learning-based robotic navigation systems

G Amir, D Corsi, R Yerushalmi, L Marzari… - … Conference on Tools …, 2023 - Springer
Deep reinforcement learning (DRL) has become a dominant deep-learning paradigm for
tasks where complex policies are learned within reactive systems. Unfortunately, these …

[HTML][HTML] Verifying generalization in deep learning

G Amir, O Maayan, T Zelazny, G Katz… - … Conference on Computer …, 2023 - Springer
Deep neural networks (DNNs) are the workhorses of deep learning, which constitutes the
state of the art in numerous application domains. However, DNN-based decision rules are …

Explain the explainer: Interpreting model-agnostic counterfactual explanations of a deep reinforcement learning agent

Z Chen, F Silvestri, G Tolomei, J Wang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Counterfactual examples (CFs) are one of the most popular methods for attaching post hoc
explanations to machine learning models. However, existing CF generation methods either …

Iterative bounding mdps: Learning interpretable policies via non-interpretable methods

N Topin, S Milani, F Fang, M Veloso - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Current work in explainable reinforcement learning generally produces policies in the form
of a decision tree over the state space. Such policies can be used for formal safety …

In Search of netUnicorn: A Data-Collection Platform to Develop Generalizable ML Models for Network Security Problems

R Beltiukov, W Guo, A Gupta, W Willinger - Proceedings of the 2023 …, 2023 - dl.acm.org
The remarkable success of the use of machine learning-based solutions for network security
problems has been impeded by the developed ML models' inability to maintain efficacy …