作者
Jonathan Krause, Benjamin Sapp, Andrew Howard, Howard Zhou, Alexander Toshev, Tom Duerig, James Philbin, Li Fei-Fei
发表日期
2016
研讨会论文
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part III 14
页码范围
301-320
出版商
Springer International Publishing
简介
Current approaches for fine-grained recognition do the following: First, recruit experts to annotate a dataset of images, optionally also collecting more structured data in the form of part annotations and bounding boxes. Second, train a model utilizing this data. Toward the goal of solving fine-grained recognition, we introduce an alternative approach, leveraging free, noisy data from the web and simple, generic methods of recognition. This approach has benefits in both performance and scalability. We demonstrate its efficacy on four fine-grained datasets, greatly exceeding existing state of the art without the manual collection of even a single label, and furthermore show first results at scaling to more than 10,000 fine-grained categories. Quantitatively, we achieve top-1 accuracies of on CUB-200-2011, on Birdsnap, on FGVC-Aircraft, and on Stanford Dogs without using their …
引用总数
20152016201720182019202020212022202320242165773595658463617
学术搜索中的文章
J Krause, B Sapp, A Howard, H Zhou, A Toshev… - Computer Vision–ECCV 2016: 14th European …, 2016