查看文章

mlr.press 中的 [PDF]

Deeply-supervised nets

作者

Chen-Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu

发表日期

2015/2/21

研讨会论文

Artificial intelligence and statistics

页码范围

562-570

出版商

Pmlr

简介

We propose deeply-supervised nets (DSN), a method that simultaneously minimizes classification error and improves the directness and transparency of the hidden layer learning process. We focus our attention on three aspects of traditional convolutional-neural-network-type (CNN-type) architectures:(1) transparency in the effect intermediate layers have on overall classification;(2) discriminativeness and robustness of learned features, especially in early layers;(3) training effectiveness in the face of “vanishing” gradients. To combat these issues, we introduce “companion” objective functions at each hidden layer, in addition to the overall objective function at the output layer (an integrated strategy distinct from layer-wise pre-training). We also analyze our algorithm using techniques extended from stochastic gradient methods. The advantages provided by our method are evident in our experimental results, showing state-of-the-art performance on MNIST, CIFAR-10, CIFAR-100, and SVHN.

引用总数

被引用次数：2776

2014201520162017201820192020202120222023202412 107 194 252 310 316 353 362 334 341 156

学术搜索中的文章

Deeply-supervised nets

CY Lee, S Xie, P Gallagher, Z Zhang, Z Tu - Artificial intelligence and statistics, 2015

被引用次数：2776 相关文章所有 14 个版本