查看文章

mlr.press 中的 [PDF]

Knowledge Transfer with Jacobian Matching

作者

Suraj Srinivas, Francois Fleuret

发表日期

2018/3/1

研讨会论文

International Conference on Machine Learning (ICML)

简介

Classical distillation methods transfer representations from a “teacher” neural network to a “student” network by matching their output activations. Recent methods also match the Jacobians, or the gradient of output activations with the input. However, this involves making some ad hoc decisions, in particular, the choice of the loss function. In this paper, we first establish an equivalence between Jacobian matching and distillation with input noise, from which we derive appropriate loss functions for Jacobian matching. We then rely on this analysis to apply Jacobian matching to transfer learning by establishing equivalence of a recent transfer learning procedure to distillation. We then show experimentally on standard image datasets that Jacobian-based penalties improve distillation, robustness to noisy inputs, and transfer learning.

引用总数

被引用次数：191

20182019202020212022202320245 13 26 35 45 37 29

学术搜索中的文章

Knowledge transfer with jacobian matching

S Srinivas, F Fleuret - International Conference on Machine Learning, 2018

被引用次数：187 相关文章所有 12 个版本

Local affine approximations for improving knowledge transfer*

S Srinivas, F Fleuret - Idiap, 2018

被引用次数：5 相关文章所有 4 个版本