Post-hoc explanations fail to achieve their purpose in adversarial contexts

S Bordt, M Finck, E Raidl, U von Luxburg - Proceedings of the 2022 ACM …, 2022 - dl.acm.org
Existing and planned legislation stipulates various obligations to provide information about
machine learning algorithms and their functioning, often interpreted as obligations to …

The road to explainability is paved with bias: Measuring the fairness of explanations

A Balagopalan, H Zhang, K Hamidieh… - Proceedings of the …, 2022 - dl.acm.org
Machine learning models in safety-critical settings like healthcare are often “blackboxes”:
they contain a large number of parameters which are not transparent to users. Post-hoc …

Explaining machine learning classifiers through diverse counterfactual explanations

RK Mothilal, A Sharma, C Tan - Proceedings of the 2020 conference on …, 2020 - dl.acm.org
Post-hoc explanations of machine learning models are crucial for people to understand and
act on algorithmic predictions. An intriguing class of explanations is through counterfactuals …

Fooling lime and shap: Adversarial attacks on post hoc explanation methods

D Slack, S Hilgard, E Jia, S Singh… - Proceedings of the AAAI …, 2020 - dl.acm.org
As machine learning black boxes are increasingly being deployed in domains such as
healthcare and criminal justice, there is growing emphasis on building tools and techniques …

[PDF][PDF] SoK: Explainable machine learning in adversarial environments

M Noppel, C Wressnegger - 2024 IEEE Symposium on …, 2023 - oaklandsok.github.io
Modern deep learning methods have long been considered black boxes due to the lack of
insights into their decision-making process. However, recent advances in explainable …

Varieties of AI explanations under the law. From the GDPR to the AIA, and beyond

P Hacker, JH Passoth - … workshop on extending explainable AI beyond …, 2020 - Springer
The quest to explain the output of artificial intelligence systems has clearly moved from a
mere technical to a highly legally and politically relevant endeavor. In this paper, we provide …

Fairness via explanation quality: Evaluating disparities in the quality of post hoc explanations

J Dai, S Upadhyay, U Aivodji, SH Bach… - Proceedings of the 2022 …, 2022 - dl.acm.org
As post hoc explanation methods are increasingly being leveraged to explain complex
models in high-stakes settings, it becomes critical to ensure that the quality of the resulting …

Can I trust the explainer? Verifying post-hoc explanatory methods

OM Camburu, E Giunchiglia, J Foerster… - arXiv preprint arXiv …, 2019 - arxiv.org
For AI systems to garner widespread public acceptance, we must develop methods capable
of explaining the decisions of black-box models such as neural networks. In this work, we …

Robustness in machine learning explanations: does it matter?

L Hancox-Li - Proceedings of the 2020 conference on fairness …, 2020 - dl.acm.org
The explainable AI literature contains multiple notions of what an explanation is and what
desiderata explanations should satisfy. One implicit source of disagreement is how far the …

Adversarial attacks and defenses in explainable artificial intelligence: A survey

H Baniecki, P Biecek - Information Fusion, 2024 - Elsevier
Explainable artificial intelligence (XAI) methods are portrayed as a remedy for debugging
and trusting statistical and deep learning models, as well as interpreting their predictions …