Knowledge distillation by on-the-fly native ensemble

Knowledge distillation and student-teacher learning for visual intelligence: A review and new outlooks

L Wang, KJ Yoon - IEEE transactions on pattern analysis and …, 2021 - ieeexplore.ieee.org

Deep neural models, in recent years, have been successful in almost every field, even
solving the most complex problem statements. However, these models are huge in size with …

被引用次数：662 相关文章所有 10 个版本

Ensemble deep learning in bioinformatics

Y Cao, TA Geddes, JYH Yang, P Yang - Nature Machine Intelligence, 2020 - nature.com

The remarkable flexibility and adaptability of ensemble methods and deep learning models
have led to the proliferation of their application in bioinformatics research. Traditionally …

被引用次数：257 相关文章所有 3 个版本

[PDF] arxiv.org

Knowledge distillation: A survey

J Gou, B Yu, SJ Maybank, D Tao - International Journal of Computer Vision, 2021 - Springer

In recent years, deep neural networks have been successful in both industry and academia,
especially for computer vision tasks. The great success of deep learning is mainly due to its …

被引用次数：2583 相关文章所有 12 个版本

[HTML] sciencedirect.com

[HTML][HTML] Comparison of CNN-based deep learning architectures for rice diseases classification

MT Ahad, Y Li, B Song, T Bhuiyan - Artificial Intelligence in Agriculture, 2023 - Elsevier

Although convolutional neural network (CNN) paradigms have expanded to transfer
learning and ensemble models from original individual CNN architectures, few studies have …

被引用次数：83 相关文章所有 5 个版本

[PDF] arxiv.org

Towards understanding ensemble, knowledge distillation and self-distillation in deep learning

Z Allen-Zhu, Y Li - arXiv preprint arXiv:2012.09816, 2020 - arxiv.org

We formally study how ensemble of deep learning models can improve test accuracy, and
how the superior performance of ensemble can be distilled into a single model using …

被引用次数：360 相关文章所有 4 个版本

[PDF] neurips.cc

Group knowledge transfer: Federated learning of large cnns at the edge

C He, M Annavaram… - Advances in Neural …, 2020 - proceedings.neurips.cc

Scaling up the convolutional neural network (CNN) size (eg, width, depth, etc.) is known to
effectively improve model accuracy. However, the large model size impedes training on …

被引用次数：401 相关文章所有 7 个版本

[PDF] arxiv.org

Contrastive representation distillation

Y Tian, D Krishnan, P Isola - arXiv preprint arXiv:1910.10699, 2019 - arxiv.org

Often we wish to transfer representational knowledge from one neural network to another.
Examples include distilling a large network into a smaller one, transferring knowledge from …

被引用次数：1165 相关文章所有 4 个版本

[PDF] thecvf.com

Dive into ambiguity: Latent distribution mining and pairwise uncertainty estimation for facial expression recognition

J She, Y Hu, H Shi, J Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com

Due to the subjective annotation and the inherent inter-class similarity of facial expressions,
one of key challenges in Facial Expression Recognition (FER) is the annotation ambiguity …

被引用次数：223 相关文章所有 6 个版本

[PDF] thecvf.com

Similarity-preserving knowledge distillation

F Tung, G Mori - Proceedings of the IEEE/CVF international …, 2019 - openaccess.thecvf.com

Abstract Knowledge distillation is a widely applicable technique for training a student neural
network under the guidance of a trained teacher network. For example, in neural network …

被引用次数：1099 相关文章所有 7 个版本

[PDF] thecvf.com

On the efficacy of knowledge distillation

JH Cho, B Hariharan - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com

In this paper, we present a thorough evaluation of the efficacy of knowledge distillation and
its dependence on student and teacher architectures. Starting with the observation that more …

被引用次数：679 相关文章所有 6 个版本

高级搜索

QQ 群