[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …

Deep face recognition: A survey

M Wang, W Deng - Neurocomputing, 2021 - Elsevier
Deep learning applies multiple processing layers to learn representations of data with
multiple levels of feature extraction. This emerging technique has reshaped the research …

[HTML][HTML] Multiscale feature extraction and fusion of image and text in VQA

S Lu, Y Ding, M Liu, Z Yin, L Yin, W Zheng - International Journal of …, 2023 - Springer
Abstract The Visual Question Answering (VQA) system is the process of finding useful
information from images related to the question to answer the question correctly. It can be …

Fine-grained image analysis with deep learning: A survey

XS Wei, YZ Song, O Mac Aodha, J Wu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …

Do adversarially robust imagenet models transfer better?

H Salman, A Ilyas, L Engstrom… - Advances in Neural …, 2020 - proceedings.neurips.cc
Transfer learning is a widely-used paradigm in deep learning, where models pre-trained on
standard datasets can be efficiently adapted to downstream tasks. Typically, better pre …

X-linear attention networks for image captioning

Y Pan, T Yao, Y Li, T Mei - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
Recent progress on fine-grained visual recognition and visual question answering has
featured Bilinear Pooling, which effectively models the 2nd order interactions across multi …

Learning attention-guided pyramidal features for few-shot fine-grained recognition

H Tang, C Yuan, Z Li, J Tang - Pattern Recognition, 2022 - Elsevier
Few-shot fine-grained recognition (FS-FGR) aims to distinguish several highly similar
objects from different sub-categories with limited supervision. However, traditional few-shot …

Metransformer: Radiology report generation by transformer with multiple learnable expert tokens

Z Wang, L Liu, L Wang, L Zhou - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In clinical scenarios, multi-specialist consultation could significantly benefit the diagnosis,
especially for intricate cases. This inspires us to explore a" multi-expert joint diagnosis" …

A survey of deep learning-based object detection

L Jiao, F Zhang, F Liu, S Yang, L Li, Z Feng… - IEEE access, 2019 - ieeexplore.ieee.org
Object detection is one of the most important and challenging branches of computer vision,
which has been widely applied in people's life, such as monitoring security, autonomous …

Second-order attention network for single image super-resolution

T Dai, J Cai, Y Zhang, ST Xia… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Recently, deep convolutional neural networks (CNNs) have been widely explored in single
image super-resolution (SISR) and obtained remarkable performance. However, most of the …