Zeroq: A novel zero shot quantization framework Y Cai, Z Yao, Z Dong, A Gholami, MW Mahoney, K Keutzer Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 410 | 2020 |
Hawq-v2: Hessian aware trace-weighted quantization of neural networks KK Zhen Dong, Zhewei Yao, Yaohui Cai, Daiyaan Arfeen, Amir Gholami, Michael ... arXiv preprint arXiv:1911.03852, 2019 | 262* | 2019 |
Quip: 2-bit quantization of large language models with guarantees J Chee, Y Cai, V Kuleshov, CM De Sa Advances in Neural Information Processing Systems 36, 2024 | 69 | 2024 |
Codenet: Efficient deployment of input-adaptive object detection on embedded fpgas Q Huang, D Wang, Z Dong, Y Gao, Y Cai, T Li, B Wu, K Keutzer, ... The 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays …, 2021 | 59* | 2021 |
Structured Pruning of CNNs at Initialization Y Cai, W Hua, H Chen, GE Suh, C De Sa, Z Zhang | 16* | |
Spade: A spectral method for black-box adversarial robustness evaluation W Cheng, C Deng, Z Zhao, Y Cai, Z Zhang, Z Feng International Conference on Machine Learning, 1814-1824, 2021 | 13 | 2021 |
A comprehensive evaluation of fpga-based spatial acceleration of llms H Chen, J Zhang, Y Du, S Xiang, Z Yue, N Zhang, Y Cai, Z Zhang Proceedings of the 2024 ACM/SIGDA International Symposium on Field …, 2024 | 10* | 2024 |
Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs D Dai, Y Zhang, J Zhang, Z Hu, Y Cai, Q Sun, Z Zhang arXiv preprint arXiv:2401.17544, 2024 | | 2024 |