The inaturalist challenge 2017 dataset

H Touvron, M Cord, H Jégou - European conference on computer vision, 2022 - Springer

Abstract A Vision Transformer (ViT) is a simple neural architecture amenable to serve
several computer vision tasks. It has limited built-in architectural priors, in contrast to more …

被引用次数：295 相关文章所有 8 个版本

[PDF] mlr.press

Training data-efficient image transformers & distillation through attention

H Touvron, M Cord, M Douze, F Massa… - International …, 2021 - proceedings.mlr.press

Recently, neural networks purely based on attention were shown to address image
understanding tasks such as image classification. These high-performing vision …

被引用次数：6321 相关文章所有 6 个版本

[PDF] pnas.org Full View

Deep learning and computer vision will transform entomology

TT Høye, J Ärje, K Bjerge… - Proceedings of the …, 2021 - National Acad Sciences

Most animal species on Earth are insects, and recent reports suggest that their abundance is
in drastic decline. Although these reports come from a wide range of insect taxa and regions …

被引用次数：314 相关文章所有 17 个版本

[PDF] arxiv.org

Three things everyone should know about vision transformers

H Touvron, M Cord, A El-Nouby, J Verbeek… - European Conference on …, 2022 - Springer

After their initial success in natural language processing, transformer architectures have
rapidly gained traction in computer vision, providing state-of-the-art results for tasks such as …

被引用次数：91 相关文章所有 6 个版本

[PDF] thecvf.com

Feature transfer learning for face recognition with under-represented data

X Yin, X Yu, K Sohn, X Liu… - Proceedings of the …, 2019 - openaccess.thecvf.com

Despite the large volume of face recognition datasets, there is a significant portion of
subjects, of which the samples are insufficient and thus under-represented. Ignoring such …

被引用次数：340 相关文章所有 9 个版本

[PDF] thecvf.com

Distributional robustness loss for long-tail learning

D Samuel, G Chechik - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Real-world data is often unbalanced and long-tailed, but deep models struggle to recognize
rare classes in the presence of frequent classes. To address unbalanced data, most studies …

被引用次数：105 相关文章所有 6 个版本

[PDF] arxiv.org

Masked modeling for self-supervised representation learning on vision and beyond

S Li, L Zhang, Z Wang, D Wu, L Wu, Z Liu, J Xia… - arXiv preprint arXiv …, 2023 - arxiv.org

As the deep learning revolution marches on, self-supervised learning has garnered
increasing attention in recent years thanks to its remarkable representation learning ability …

被引用次数：6 相关文章所有 2 个版本

[PDF] mdpi.com

A convolutional neural network classifier identifies tree species in mixed-conifer forest from hyperspectral imagery

GA Fricker, JD Ventura, JA Wolf, MP North, FW Davis… - Remote Sensing, 2019 - mdpi.com

In this study, we automate tree species classification and mapping using field-based training
data, high spatial resolution airborne hyperspectral imagery, and a convolutional neural …

被引用次数：176 相关文章所有 15 个版本

[PDF] thecvf.com

Action recognition with spatial-temporal discriminative filter banks

B Martinez, D Modolo, Y Xiong… - Proceedings of the …, 2019 - openaccess.thecvf.com

Action recognition has seen a dramatic performance improvement in the last few years. Most
of the current state-of-the-art literature either aims at improving performance through …

被引用次数：89 相关文章所有 6 个版本

Wind turbine blade surface inspection based on deep learning and UAV-taken images

D Xu, C Wen, J Liu - Journal of Renewable and Sustainable Energy, 2019 - pubs.aip.org

As a key component of wind turbines (WTs), the blade conditions are related to the WT
normal operation and the WT blade inspection is a significant task. Most studies of WT blade …

被引用次数：52 相关文章所有 2 个版本

高级搜索

QQ 群