Deep audio-visual learning: A survey

H Zhu, MD Luo, R Wang, AH Zheng, R He - International Journal of …, 2021 - Springer
Audio-visual learning, aimed at exploiting the relationship between audio and visual
modalities, has drawn considerable attention since deep learning started to be used …

A comprehensive survey on deep gait recognition: algorithms, datasets and challenges

C Shen, S Yu, J Wang, GQ Huang, L Wang - arXiv preprint arXiv …, 2022 - arxiv.org
Gait recognition aims at identifying a person at a distance through visual cameras. With the
emergence of deep learning, significant advancements in gait recognition have achieved …

Learning attention-guided pyramidal features for few-shot fine-grained recognition

H Tang, C Yuan, Z Li, J Tang - Pattern Recognition, 2022 - Elsevier
Few-shot fine-grained recognition (FS-FGR) aims to distinguish several highly similar
objects from different sub-categories with limited supervision. However, traditional few-shot …

Regionvit: Regional-to-local attention for vision transformers

CF Chen, R Panda, Q Fan - arXiv preprint arXiv:2106.02689, 2021 - arxiv.org
Vision transformer (ViT) has recently shown its strong capability in achieving comparable
results to convolutional neural networks (CNNs) on image classification. However, vanilla …

CABNet: Category attention block for imbalanced diabetic retinopathy grading

A He, T Li, N Li, K Wang, H Fu - IEEE Transactions on Medical …, 2020 - ieeexplore.ieee.org
Diabetic Retinopathy (DR) grading is challenging due to the presence of intra-class
variations, small lesions and imbalanced data distributions. The key for solving fine-grained …

A spatial feature-enhanced attention neural network with high-order pooling representation for application in pest and disease recognition

J Kong, H Wang, C Yang, X Jin, M Zuo, X Zhang - Agriculture, 2022 - mdpi.com
With the development of advanced information and intelligence technologies, precision
agriculture has become an effective solution to monitor and prevent crop pests and …

Boosting few-shot fine-grained recognition with background suppression and foreground alignment

Z Zha, H Tang, Y Sun, J Tang - IEEE Transactions on Circuits …, 2023 - ieeexplore.ieee.org
Few-shot fine-grained recognition (FS-FGR) aims to recognize novel fine-grained categories
with the help of limited available samples. Undoubtedly, this task inherits the main …

TransIFC: invariant cues-aware feature concentration learning for efficient fine-grained bird image classification

H Liu, C Zhang, Y Deng, B Xie, T Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Fine-grained bird image classification (FBIC) is not only meaningful for endangered bird
observation and protection but also a prevalent task for image classification in multimedia …

Context-aware attentional pooling (cap) for fine-grained visual classification

A Behera, Z Wharton, PRPG Hewage… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Deep convolutional neural networks (CNNs) have shown a strong ability in mining
discriminative object pose and parts information for image recognition. For fine-grained …

Aa-trans: Core attention aggregating transformer with information entropy selector for fine-grained visual classification

Q Wang, JJ Wang, H Deng, X Wu, Y Wang, G Hao - Pattern Recognition, 2023 - Elsevier
The task of fine-grained visual classification (FGVC) is to distinguish targets from
subordinate classifications. Since fine-grained images have the inherent characteristic of …