相关文章- 学术资源搜索

Post-hoc explanations fail to achieve their purpose in adversarial contexts

S Bordt, M Finck, E Raidl, U von Luxburg - Proceedings of the 2022 ACM …, 2022 - dl.acm.org

Existing and planned legislation stipulates various obligations to provide information about
machine learning algorithms and their functioning, often interpreted as obligations to …

被引用次数：64 相关文章所有 8 个版本

[PDF] acm.org

The road to explainability is paved with bias: Measuring the fairness of explanations

A Balagopalan, H Zhang, K Hamidieh… - Proceedings of the …, 2022 - dl.acm.org

Machine learning models in safety-critical settings like healthcare are often “blackboxes”:
they contain a large number of parameters which are not transparent to users. Post-hoc …

被引用次数：72 相关文章所有 8 个版本

[PDF] acm.org

Explaining machine learning classifiers through diverse counterfactual explanations

RK Mothilal, A Sharma, C Tan - Proceedings of the 2020 conference on …, 2020 - dl.acm.org

Post-hoc explanations of machine learning models are crucial for people to understand and
act on algorithmic predictions. An intriguing class of explanations is through counterfactuals …

被引用次数：1077 相关文章所有 6 个版本

[PDF] acm.org

Fooling lime and shap: Adversarial attacks on post hoc explanation methods

D Slack, S Hilgard, E Jia, S Singh… - Proceedings of the AAAI …, 2020 - dl.acm.org

As machine learning black boxes are increasingly being deployed in domains such as
healthcare and criminal justice, there is growing emphasis on building tools and techniques …

被引用次数：864 相关文章所有 9 个版本

[PDF] github.io

[PDF][PDF] SoK: Explainable machine learning in adversarial environments

M Noppel, C Wressnegger - 2024 IEEE Symposium on …, 2023 - oaklandsok.github.io

Modern deep learning methods have long been considered black boxes due to the lack of
insights into their decision-making process. However, recent advances in explainable …

被引用次数：6 相关文章所有 8 个版本

[PDF] springer.com

Varieties of AI explanations under the law. From the GDPR to the AIA, and beyond

P Hacker, JH Passoth - … workshop on extending explainable AI beyond …, 2020 - Springer

The quest to explain the output of artificial intelligence systems has clearly moved from a
mere technical to a highly legally and politically relevant endeavor. In this paper, we provide …

被引用次数：60 相关文章所有 3 个版本

[PDF] acm.org

Fairness via explanation quality: Evaluating disparities in the quality of post hoc explanations

J Dai, S Upadhyay, U Aivodji, SH Bach… - Proceedings of the 2022 …, 2022 - dl.acm.org

As post hoc explanation methods are increasingly being leveraged to explain complex
models in high-stakes settings, it becomes critical to ensure that the quality of the resulting …

被引用次数：49 相关文章所有 6 个版本

[PDF] arxiv.org

Can I trust the explainer? Verifying post-hoc explanatory methods

OM Camburu, E Giunchiglia, J Foerster… - arXiv preprint arXiv …, 2019 - arxiv.org

For AI systems to garner widespread public acceptance, we must develop methods capable
of explaining the decisions of black-box models such as neural networks. In this work, we …

被引用次数：56 相关文章所有 5 个版本

[PDF] pitt.edu

Robustness in machine learning explanations: does it matter?

L Hancox-Li - Proceedings of the 2020 conference on fairness …, 2020 - dl.acm.org

The explainable AI literature contains multiple notions of what an explanation is and what
desiderata explanations should satisfy. One implicit source of disagreement is how far the …

被引用次数：108 相关文章所有 6 个版本

[PDF] arxiv.org

Adversarial attacks and defenses in explainable artificial intelligence: A survey

H Baniecki, P Biecek - Information Fusion, 2024 - Elsevier

Explainable artificial intelligence (XAI) methods are portrayed as a remedy for debugging
and trusting statistical and deep learning models, as well as interpreting their predictions …

被引用次数：23 相关文章所有 5 个版本

高级搜索

QQ 群

Post-hoc explanations fail to achieve their purpose in adversarial contexts

The road to explainability is paved with bias: Measuring the fairness of explanations

Explaining machine learning classifiers through diverse counterfactual explanations

Fooling lime and shap: Adversarial attacks on post hoc explanation methods

[PDF][PDF] SoK: Explainable machine learning in adversarial environments

Varieties of AI explanations under the law. From the GDPR to the AIA, and beyond

Fairness via explanation quality: Evaluating disparities in the quality of post hoc explanations

Can I trust the explainer? Verifying post-hoc explanatory methods

Robustness in machine learning explanations: does it matter?

Adversarial attacks and defenses in explainable artificial intelligence: A survey

相关搜索

引用