查看文章

jmlr.org 中的 [PDF]

Exploring strategies for training deep neural networks.

作者

Hugo Larochelle, Yoshua Bengio, Jérôme Louradour, Pascal Lamblin

发表日期

2009/1/1

期刊

Journal of machine learning research

卷号

期号

简介

Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization often appears to get stuck in poor solutions. Hinton et al. recently proposed a greedy layer-wise unsupervised learning procedure relying on the training algorithm of restricted Boltzmann machines (RBM) to initialize the parameters of a deep belief network (DBN), a generative model with many layers of hidden causal variables. This was followed by the proposal of another greedy layer-wise procedure, relying on the usage of autoassociator networks. In the context of the above optimization problem, we study these algorithms empirically to better understand their success. Our experiments confirm the hypothesis that the greedy layer-wise unsupervised training strategy helps the optimization by initializing weights in a region near a good local minimum, but also implicitly acts as a sort of regularization that brings better generalization and encourages internal distributed representations that are high-level abstractions of the input. We also present a series of experiments aimed at evaluating the link between the performance of deep neural networks and practical aspects of their topology, for example, demonstrating cases where the addition of more depth helps. Finally, we empirically explore simple variants of these training algorithms, such as the use of different RBM input unit distributions, a simple way of combining gradient estimators to improve …

引用总数

被引用次数：1424

20092010201120122013201420152016201720182019202020212022202320248 39 29 33 45 74 97 166 144 138 95 128 130 113 115 54

学术搜索中的文章

Exploring strategies for training deep neural networks.

H Larochelle, Y Bengio, J Louradour, P Lamblin - Journal of machine learning research, 2009

被引用次数：1424 相关文章所有 20 个版本