Z Zhang, J Li, Z Wu, J Shen, J Xu - arXiv preprint arXiv:2407.13147, 2024 - arxiv.org
In recent years, current mainstream feature masking distillation methods mainly function by
reconstructing selectively masked regions of a student network from the feature maps of a …