Interpretable deep learning: Interpretation, interpretability, trustworthiness, and beyond

X Li, H Xiong, X Li, X Wu, X Zhang, J Liu, J Bian… - … and Information Systems, 2022 - Springer
Deep neural networks have been well-known for their superb handling of various machine
learning and artificial intelligence tasks. However, due to their over-parameterized black-box …

A survey on neural network interpretability

Y Zhang, P Tiňo, A Leonardis… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Along with the great success of deep neural networks, there is also growing concern about
their black-box nature. The interpretability issue affects people's trust on deep learning …

Towards faithfully interpretable NLP systems: How should we define and evaluate faithfulness?

A Jacovi, Y Goldberg - arXiv preprint arXiv:2004.03685, 2020 - arxiv.org
With the growing popularity of deep-learning based NLP models, comes a need for
interpretable systems. But what is interpretability, and what constitutes a high-quality …

[PDF][PDF] On interpretability of artificial neural networks

F Fan, J Xiong, G Wang - arXiv preprint arXiv:2001.02522, 2020 - researchgate.net
Deep learning has achieved great successes in many important areas to dealing with text,
images, video, graphs, and so on. However, the black-box nature of deep artificial neural …

A comparative study of faithfulness metrics for model interpretability methods

CS Chan, H Kong, G Liang - arXiv preprint arXiv:2204.05514, 2022 - arxiv.org
Interpretation methods to reveal the internal reasoning processes behind machine learning
models have attracted increasing attention in recent years. To quantify the extent to which …

A survey on the interpretability of deep learning in medical diagnosis

Q Teng, Z Liu, Y Song, K Han, Y Lu - Multimedia Systems, 2022 - Springer
Deep learning has demonstrated remarkable performance in the medical domain, with
accuracy that rivals or even exceeds that of human experts. However, it has a significant …

Review study of interpretation methods for future interpretable machine learning

JX Mi, AD Li, LF Zhou - IEEE Access, 2020 - ieeexplore.ieee.org
In recent years, black-box models have developed rapidly because of their high accuracy.
Balancing the interpretability and accuracy is increasingly important. The lack of …

Interpretability of deep learning models: A survey of results

S Chakraborty, R Tomsett… - … , advanced & trusted …, 2017 - ieeexplore.ieee.org
Deep neural networks have achieved near-human accuracy levels in various types of
classification and prediction tasks including images, text, speech, and video data. However …

Explaining the black-box model: A survey of local interpretation methods for deep neural networks

Y Liang, S Li, C Yan, M Li, C Jiang - Neurocomputing, 2021 - Elsevier
Recently, a significant amount of research has been investigated on interpretation of deep
neural networks (DNNs) which are normally processed as black box models. Among the …

Toward transparent ai: A survey on interpreting the inner structures of deep neural networks

T Räuker, A Ho, S Casper… - 2023 ieee conference …, 2023 - ieeexplore.ieee.org
The last decade of machine learning has seen drastic increases in scale and capabilities.
Deep neural networks (DNNs) are increasingly being deployed in the real world. However …