A survey on deep learning-based fine-grained object classification and semantic segmentation

B Zhao, J Feng, X Wu, S Yan - International Journal of Automation and …, 2017 - Springer
The deep learning technology has shown impressive performance in various vision tasks
such as image classification, object detection and semantic segmentation. In particular …

Counterfactual attention learning for fine-grained visual categorization and re-identification

Y Rao, G Chen, J Lu, J Zhou - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Attention mechanism has demonstrated great potential in fine-grained visual recognition
tasks. In this paper, we present a counterfactual attention learning method to learn more …

Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer
In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

Concept whitening for interpretable image recognition

Z Chen, Y Bei, C Rudin - Nature Machine Intelligence, 2020 - nature.com
What does a neural network encode about a concept as we traverse through the layers?
Interpretability in machine learning is undoubtedly important, but the calculations of neural …

Picanet: Learning pixel-wise contextual attention for saliency detection

N Liu, J Han, MH Yang - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Contexts play an important role in the saliency detection task. However, given a context
region, not all contextual information is helpful for the final task. In this paper, we propose a …

A new image classification method using CNN transfer learning and web data augmentation

D Han, Q Liu, W Fan - Expert Systems with Applications, 2018 - Elsevier
Abstract Since Convolutional Neural Network (CNN) won the image classification
competition 202 (ILSVRC12), a lot of attention has been paid to deep layer CNN study. The …

Selective sparse sampling for fine-grained image recognition

Y Ding, Y Zhou, Y Zhu, Q Ye… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Fine-grained recognition poses the unique challenge of capturing subtle inter-class
differences under considerable intra-class variances (eg, beaks for bird species) …

Spatial transformer networks

M Jaderberg, K Simonyan… - Advances in neural …, 2015 - proceedings.neurips.cc
Abstract Convolutional Neural Networks define an exceptionallypowerful class of model, but
are still limited by the lack of abilityto be spatially invariant to the input data in a …

Draw: A recurrent neural network for image generation

K Gregor, I Danihelka, A Graves… - International …, 2015 - proceedings.mlr.press
This paper introduces the Deep Recurrent Attentive Writer (DRAW) architecture for image
generation with neural networks. DRAW networks combine a novel spatial attention …

End-to-end learning of action detection from frame glimpses in videos

S Yeung, O Russakovsky, G Mori… - Proceedings of the …, 2016 - openaccess.thecvf.com
In this work we introduce a fully end-to-end approach for action detection in videos that
learns to directly predict the temporal bounds of actions. Our intuition is that the process of …