The inaturalist species classification and detection dataset

G Van Horn, O Mac Aodha, Y Song… - Proceedings of the …, 2018 - openaccess.thecvf.com
Existing image classification datasets used in computer vision tend to have a uniform
distribution of images across object categories. In contrast, the natural world is heavily …

Large scale fine-grained categorization and domain-specific transfer learning

Y Cui, Y Song, C Sun, A Howard… - Proceedings of the …, 2018 - openaccess.thecvf.com
Transferring the knowledge learned from large scale datasets (eg, ImageNet) via fine-tuning
offers an effective solution for domain-specific fine-grained visual categorization (FGVC) …

Understanding neural networks through representation erasure

J Li, W Monroe, D Jurafsky - arXiv preprint arXiv:1612.08220, 2016 - arxiv.org
While neural networks have been successfully applied to many natural language processing
tasks, they come at the cost of interpretability. In this paper, we propose a general …

Bilinear CNN models for fine-grained visual recognition

TY Lin, A RoyChowdhury, S Maji - Proceedings of the IEEE …, 2015 - cv-foundation.org
We propose bilinear models, a recognition architecture that consists of two feature extractors
whose outputs are multiplied using outer product at each location of the image and pooled …

Part-stacked CNN for fine-grained visual categorization

S Huang, Z Xu, D Tao, Y Zhang - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
In the context of fine-grained visual categorization, the ability to interpret models as human-
understandable visual manuals is sometimes as important as achieving high classification …

Fine-grained recognition without part annotations

J Krause, H Jin, J Yang, L Fei-Fei - Proceedings of the IEEE …, 2015 - cv-foundation.org
Scaling up fine-grained recognition to all domains of fine-grained objects is a challenge the
computer vision community will need to face in order to realize its goal of recognizing all …

Cross-domain image retrieval with a dual attribute-aware ranking network

J Huang, RS Feris, Q Chen, S Yan - Proceedings of the IEEE …, 2015 - cv-foundation.org
We address the problem of cross-domain image retrieval, considering the following practical
application: given a user photo depicting a clothing image, our goal is to retrieve the same or …

The unreasonable effectiveness of noisy data for fine-grained recognition

J Krause, B Sapp, A Howard, H Zhou, A Toshev… - Computer Vision–ECCV …, 2016 - Springer
Current approaches for fine-grained recognition do the following: First, recruit experts to
annotate a dataset of images, optionally also collecting more structured data in the form of …

Multi-task CNN model for attribute prediction

AH Abdulnabi, G Wang, J Lu… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
This paper proposes a joint multi-task learning algorithm to better predict attributes in
images using deep convolutional neural networks (CNN). We consider learning binary …

The devil is in the tails: Fine-grained classification in the wild

G Van Horn, P Perona - arXiv preprint arXiv:1709.01450, 2017 - arxiv.org
The world is long-tailed. What does this mean for computer vision and visual recognition?
The main two implications are (1) the number of categories we need to consider in …