作者
Tian Zhang, Dongliang Chang, Zhanyu Ma, Jun Guo
发表日期
2021/12/5
研讨会论文
2021 International Conference on Visual Communications and Image Processing (VCIP)
页码范围
1-5
出版商
IEEE
简介
Fine-grained visual classification aims to recognize images belonging to multiple sub-categories within a same category. It is a challenging task due to the inherently subtle variations among highly-confused categories. Most existing methods only take an individual image as input, which may limit the ability of models to recognize contrastive clues from different images. In this paper, we propose an effective method called progressive co-attention network (PCA-Net) to tackle this problem. Specifically, we calculate the channel-wise similarity by encouraging interaction between the feature channels within same-category image pairs to capture the common discriminative features. Considering that complementary information is also crucial for recognition, we erase the prominent areas enhanced by the channel interaction to force the network to focus on other discriminative regions. The proposed model has achieved …
引用总数
学术搜索中的文章
T Zhang, D Chang, Z Ma, J Guo - … Conference on Visual Communications and Image …, 2021