Deit iii: Revenge of the vit

H Touvron, M Cord, H Jégou - European conference on computer vision, 2022 - Springer
Abstract A Vision Transformer (ViT) is a simple neural architecture amenable to serve
several computer vision tasks. It has limited built-in architectural priors, in contrast to more …

Training data-efficient image transformers & distillation through attention

H Touvron, M Cord, M Douze, F Massa… - International …, 2021 - proceedings.mlr.press
Recently, neural networks purely based on attention were shown to address image
understanding tasks such as image classification. These high-performing vision …

Deep learning and computer vision will transform entomology

TT Høye, J Ärje, K Bjerge… - Proceedings of the …, 2021 - National Acad Sciences
Most animal species on Earth are insects, and recent reports suggest that their abundance is
in drastic decline. Although these reports come from a wide range of insect taxa and regions …

Three things everyone should know about vision transformers

H Touvron, M Cord, A El-Nouby, J Verbeek… - European Conference on …, 2022 - Springer
After their initial success in natural language processing, transformer architectures have
rapidly gained traction in computer vision, providing state-of-the-art results for tasks such as …

Feature transfer learning for face recognition with under-represented data

X Yin, X Yu, K Sohn, X Liu… - Proceedings of the …, 2019 - openaccess.thecvf.com
Despite the large volume of face recognition datasets, there is a significant portion of
subjects, of which the samples are insufficient and thus under-represented. Ignoring such …

Distributional robustness loss for long-tail learning

D Samuel, G Chechik - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Real-world data is often unbalanced and long-tailed, but deep models struggle to recognize
rare classes in the presence of frequent classes. To address unbalanced data, most studies …

Masked modeling for self-supervised representation learning on vision and beyond

S Li, L Zhang, Z Wang, D Wu, L Wu, Z Liu, J Xia… - arXiv preprint arXiv …, 2023 - arxiv.org
As the deep learning revolution marches on, self-supervised learning has garnered
increasing attention in recent years thanks to its remarkable representation learning ability …

A convolutional neural network classifier identifies tree species in mixed-conifer forest from hyperspectral imagery

GA Fricker, JD Ventura, JA Wolf, MP North, FW Davis… - Remote Sensing, 2019 - mdpi.com
In this study, we automate tree species classification and mapping using field-based training
data, high spatial resolution airborne hyperspectral imagery, and a convolutional neural …

Action recognition with spatial-temporal discriminative filter banks

B Martinez, D Modolo, Y Xiong… - Proceedings of the …, 2019 - openaccess.thecvf.com
Action recognition has seen a dramatic performance improvement in the last few years. Most
of the current state-of-the-art literature either aims at improving performance through …

Wind turbine blade surface inspection based on deep learning and UAV-taken images

D Xu, C Wen, J Liu - Journal of Renewable and Sustainable Energy, 2019 - pubs.aip.org
As a key component of wind turbines (WTs), the blade conditions are related to the WT
normal operation and the WT blade inspection is a significant task. Most studies of WT blade …