Z Hao, J Guo, K Han, Y Tang, H Hu, Y Wang… - Proceedings of the 37th …, 2023 - dl.acm.org
Knowledge distillation (KD) has proven to be a highly effective approach for enhancing
model performance through a teacher-student training scheme. However, most existing …