作者
Robert Fergus, Li Fei-Fei, Pietro Perona, Andrew Zisserman
发表日期
2005/10/17
研讨会论文
Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1
卷号
2
页码范围
1816-1823
出版商
IEEE
简介
Current approaches to object category recognition require datasets of training images to be manually prepared, with varying degrees of supervision. We present an approach that can learn an object category from just its name, by utilizing the raw output of image search engines available on the Internet. We develop a new model, TSI-pLSA, which extends pLSA (as applied to visual words) to include spatial information in a translation and scale invariant manner. Our approach can handle the high intra-class variability and large proportion of unrelated images returned by search engines. We evaluate tire models on standard test sets, showing performance competitive with existing methods trained on hand prepared datasets
引用总数
200520062007200820092010201120122013201420152016201720182019202020212022202320248306088105958385789263403328811141894
学术搜索中的文章
R Fergus, L Fei-Fei, P Perona, A Zisserman - Tenth IEEE International Conference on Computer …, 2005