PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search Y Xu, L Xie, X Zhang, X Chen, GJ Qi, Q Tian, H Xiong International Conference on Learning Representations, 2020 | 802 | 2020 |
Deep neural network compression with single and multiple level quantization Y Xu, Y Wang, A Zhou, W Lin, H Xiong Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 138 | 2018 |
Trp: Trained rank pruning for efficient deep neural networks Y Xu, Y Li, S Zhang, W Wen, B Wang, Y Qi, Y Chen, W Lin, H Xiong IJCAI 2020, 2020 | 127* | 2020 |
Weight-sharing neural architecture search: A battle to shrink the optimization gap L Xie, X Chen, K Bi, L Wei, Y Xu, L Wang, Z Chen, A Xiao, J Chang, ... ACM Computing Surveys (CSUR) 54 (9), 1-37, 2021 | 116* | 2021 |
Partially-connected neural architecture search for reduced computational redundancy Y Xu, L Xie, W Dai, X Zhang, X Chen, GJ Qi, H Xiong, Q Tian IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (9), 2953-2970, 2021 | 54 | 2021 |
Qa-lora: Quantization-aware low-rank adaptation of large language models Y Xu, L Xie, X Gu, X Chen, H Chang, H Zhang, Z Chen, X Zhang, Q Tian ICLR 2024, 2023 | 45 | 2023 |
Latency-aware differentiable neural architecture search Y Xu, L Xie, X Zhang, X Chen, B Shi, Q Tian, H Xiong arXiv preprint arXiv:2001.06392, 2020 | 40 | 2020 |
Filter level pruning based on similar feature extraction for convolutional neural networks L Li, Y Xu, J Zhu IEICE TRANSACTIONS on Information and Systems 101 (4), 1203-1206, 2018 | 26 | 2018 |
Fitting the search space of weight-sharing nas with graph convolutional networks X Chen, L Xie, J Wu, L Wei, Y Xu, Q Tian Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7064-7072, 2021 | 19 | 2021 |
Iterative deep neural network quantization with lipschitz constraint Y Xu, W Dai, Y Qi, J Zou, H Xiong IEEE Transactions on Multimedia 22 (7), 1874-1888, 2019 | 19 | 2019 |
Bnet: Batch normalization with enhanced linear transformation Y Xu, L Xie, C Xie, W Dai, J Mei, S Qiao, W Shen, H Xiong, A Yuille IEEE transactions on pattern analysis and machine intelligence 45 (7), 9225-9232, 2023 | 12* | 2023 |
DNQ: Dynamic Network Quantization Y Xu, S Zhang, Y Qi, J Guo, W Lin, H Xiong Data Compression Conference (DCC2019), 2018 | 10 | 2018 |
Dynamic-stride-net: Deep convolutional neural network with dynamic stride Z Yang, Y Xu, W Dai, H Xiong Optoelectronic Imaging and Multimedia Technology VI 11187, 42-53, 2019 | 8 | 2019 |
Fedexg: Federated learning with model exchange Z Mao, W Dai, C Li, Y Xu, S Wang, J Zou, H Xiong 2020 IEEE International Symposium on Circuits and Systems (ISCAS), 1-5, 2020 | 7 | 2020 |
Tiny-hourglassnet: An efficient design for 3d human pose estimation B Shi, Y Xu, W Dai, B Wang, S Zhang, C Li, J Zou, H Xiong 2020 IEEE international conference on image processing (ICIP), 1491-1495, 2020 | 4 | 2020 |
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models X Lu, Q Liu, Y Xu, A Zhou, S Huang, B Zhang, J Yan, H Li ACL 2024, 2024 | 3 | 2024 |
Noise-to-Compression Variational Autoencoder for Efficient End-to-End Optimized Image Coding J Luo, S Li, W Dai, Y Xu, D Cheng, G Li, H Xiong 2020 Data Compression Conference (DCC), 33-42, 2020 | 3 | 2020 |
Feature map alignment: Towards efficient design of mixed-precision quantization scheme Y Bao, Y Xu, H Xiong 2019 IEEE Visual Communications and Image Processing (VCIP), 1-4, 2019 | 1 | 2019 |
One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments K Yi, Y Xu, H Chang, C Tang, Y Meng, T Zhang, J Li arXiv preprint arXiv:2405.20202, 2024 | | 2024 |
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models X Lu, A Zhou, Y Xu, R Zhang, P Gao, H Li ICML 2024, 2024 | | 2024 |