查看文章

kavukcuoglu.org 中的 [PDF]

What is the best multi-stage architecture for object recognition?

作者

Kevin Jarrett, Koray Kavukcuoglu, Marc’Aurelio Ranzato, Yann LeCun

发表日期

2009/9/29

研讨会论文

Computer Vision, 2009. ICCV 2009. IEEE 12th International Conference on

页码范围

2146-2153

出版商

IEEE

简介

In many recent object recognition systems, feature extraction stages are generally composed of a filter bank, a non-linear transformation, and some sort of feature pooling layer. Most systems use only one stage of feature extraction in which the filters are hard-wired, or two stages where the filters in one or both stages are learned in supervised or unsupervised mode. This paper addresses three questions: 1. How does the non-linearities that follow the filter banks influence the recognition accuracy? 2. does learning the filter banks in an unsupervised or supervised manner improve the performance over random filters or hardwired filters? 3. Is there any advantage to using an architecture with two stages of feature extraction, rather than one? We show that using non-linearities that include rectification and local contrast normalization is the single most important ingredient for good accuracy on object recognition …

引用总数

被引用次数：3077

20102011201220132014201520162017201820192020202120222023202435 74 75 122 166 207 286 312 345 342 318 289 221 172 58

学术搜索中的文章

What is the best multi-stage architecture for object recognition?

K Jarrett, K Kavukcuoglu, MA Ranzato, Y LeCun - 2009 IEEE 12th international conference on computer …, 2009

被引用次数：3077 相关文章所有 23 个版本