Right for better reasons: Training differentiable models by constraining their influence functions

Explainability of artificial intelligence methods, applications and challenges: A comprehensive survey

W Ding, M Abdel-Basset, H Hawash, AM Ali - Information Sciences, 2022 - Elsevier

The continuous advancement of Artificial Intelligence (AI) has been revolutionizing the
strategy of decision-making in different life domains. Regardless of this achievement, AI …

被引用次数：128 相关文章所有 2 个版本

[PDF] frontiersin.org

Leveraging explanations in interactive machine learning: An overview

S Teso, Ö Alkan, W Stammer, E Daly - Frontiers in Artificial …, 2023 - frontiersin.org

Explanations have gained an increasing level of interest in the AI and Machine Learning
(ML) communities in order to improve model transparency and allow users to form a mental …

被引用次数：59 相关文章所有 11 个版本

[PDF] acs.org

A perspective on explanations of molecular prediction models

GP Wellawatte, HA Gandhi, A Seshadri… - Journal of Chemical …, 2023 - ACS Publications

Chemists can be skeptical in using deep learning (DL) in decision making, due to the lack of
interpretability in “black-box” models. Explainable artificial intelligence (XAI) is a branch of …

被引用次数：44 相关文章所有 8 个版本

[PDF] usenix.org

{Meta-Sift}: How to Sift Out a Clean Subset in the Presence of Data Poisoning?

Y Zeng, M Pan, H Jahagirdar, M Jin, L Lyu… - 32nd USENIX Security …, 2023 - usenix.org

External data sources are increasingly being used to train machine learning (ML) models as
the data demand increases. However, the integration of external data into training poses …

被引用次数：17 相关文章所有 3 个版本

[PDF] arxiv.org

Concept-level debugging of part-prototype networks

A Bontempelli, S Teso, K Tentori, F Giunchiglia… - arXiv preprint arXiv …, 2022 - arxiv.org

Part-prototype Networks (ProtoPNets) are concept-based classifiers designed to achieve the
same performance as black-box models without compromising transparency. ProtoPNets …

被引用次数：44 相关文章所有 7 个版本

[PDF] arxiv.org

A rationale-centric framework for human-in-the-loop machine learning

J Lu, L Yang, B Mac Namee, Y Zhang - arXiv preprint arXiv:2203.12918, 2022 - arxiv.org

We present a novel rationale-centric framework with human-in-the-loop--Rationales-centric
Double-robustness Learning (RDL)--to boost model out-of-distribution performance in few …

被引用次数：36 相关文章所有 8 个版本

[PDF] thecvf.com

Studying How to Efficiently and Effectively Guide Models with Explanations

S Rao, M Böhle, A Parchami-Araghi… - Proceedings of the …, 2023 - openaccess.thecvf.com

Despite being highly performant, deep neural networks might base their decisions on
features that spuriously correlate with the provided labels, thus hurting generalization. To …

被引用次数：8 相关文章所有 5 个版本

[PDF] arxiv.org

A typology for exploring the mitigation of shortcut behaviour

F Friedrich, W Stammer, P Schramowski… - Nature Machine …, 2023 - nature.com

As machine learning models become larger, and are increasingly trained on large and
uncurated datasets in weakly supervised mode, it becomes important to establish …

被引用次数：16 相关文章所有 4 个版本

[PDF] arxiv.org

Identifying spurious correlations and correcting them with an explanation-based learning

MT Hagos, KM Curran, B Mac Namee - arXiv preprint arXiv:2211.08285, 2022 - arxiv.org

Identifying spurious correlations learned by a trained model is at the core of refining a
trained model and building a trustworthy model. We present a simple method to identify …

被引用次数：9 相关文章

[PDF] aaai.org

Targeted Activation Penalties Help CNNs Ignore Spurious Signals

D Zhang, M Williams, F Toni - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Neural networks (NNs) can learn to rely on spurious signals in the training data, leading to
poor generalisation. Recent methods tackle this problem by training NNs with additional …

被引用次数：1 相关文章所有 3 个版本

高级搜索

QQ 群