Approximating cnns with bag-of-local-features models works surprisingly well on imagenet

R Geirhos, JH Jacobsen, C Michaelis… - Nature Machine …, 2020 - nature.com

Deep learning has triggered the current rise of artificial intelligence and is the workhorse of
today's machine intelligence. Numerous success stories have rapidly spread all over …

被引用次数：1783 相关文章所有 12 个版本

[PDF] ieee.org

Explaining deep neural networks and beyond: A review of methods and applications

W Samek, G Montavon, S Lapuschkin… - Proceedings of the …, 2021 - ieeexplore.ieee.org

With the broader and highly successful usage of machine learning (ML) in industry and the
sciences, there has been a growing demand for explainable artificial intelligence (XAI) …

被引用次数：936 相关文章所有 8 个版本

[HTML] sciencedirect.com

[HTML][HTML] Explainable Artificial Intelligence (XAI): What we know and what is left to attain Trustworthy Artificial Intelligence

S Ali, T Abuhmed, S El-Sappagh, K Muhammad… - Information fusion, 2023 - Elsevier

Artificial intelligence (AI) is currently being utilized in a wide range of sophisticated
applications, but the outcomes of many AI models are challenging to comprehend and trust …

被引用次数：297 相关文章所有 9 个版本

[PDF] thecvf.com

Scaling vision transformers to gigapixel images via hierarchical self-supervised learning

RJ Chen, C Chen, Y Li, TY Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract Vision Transformers (ViTs) and their multi-scale and hierarchical variations have
been successful at capturing image representations but their use has been generally …

被引用次数：311 相关文章所有 6 个版本

[PDF] thecvf.com

Scaling up your kernels to 31x31: Revisiting large kernel design in cnns

X Ding, X Zhang, J Han, G Ding - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by
recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few …

被引用次数：729 相关文章所有 10 个版本

[PDF] openreview.net

When and why vision-language models behave like bags-of-words, and what to do about it?

M Yuksekgonul, F Bianchi, P Kalluri… - The Eleventh …, 2023 - openreview.net

Despite the success of large vision and language models (VLMs) in many downstream
applications, it is unclear how well they encode the compositional relationships between …

被引用次数：186 相关文章所有 3 个版本

[PDF] arxiv.org

Last layer re-training is sufficient for robustness to spurious correlations

P Kirichenko, P Izmailov, AG Wilson - arXiv preprint arXiv:2204.02937, 2022 - arxiv.org

Neural network classifiers can largely rely on simple spurious features, such as
backgrounds, to make predictions. However, even in these cases, we show that they still …

被引用次数：214 相关文章所有 6 个版本

[PDF] thecvf.com

Towards total recall in industrial anomaly detection

K Roth, L Pemula, J Zepeda… - Proceedings of the …, 2022 - openaccess.thecvf.com

Being able to spot defective parts is a critical component in large-scale industrial
manufacturing. A particular challenge that we address in this work is the cold-start problem …

被引用次数：627 相关文章所有 8 个版本

[PDF] neurips.cc

Intriguing properties of vision transformers

MM Naseer, K Ranasinghe, SH Khan… - Advances in …, 2021 - proceedings.neurips.cc

Vision transformers (ViT) have demonstrated impressive performance across numerous
machine vision tasks. These models are based on multi-head self-attention mechanisms that …

被引用次数：559 相关文章所有 11 个版本

[PDF] thecvf.com

Relational embedding for few-shot classification

D Kang, H Kwon, J Min, M Cho - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

We propose to address the problem of few-shot classification by meta-learning" what to
observe" and" where to attend" in a relational perspective. Our method leverages relational …

被引用次数：202 相关文章所有 6 个版本

高级搜索

QQ 群