The yin and yang of power and performance for asymmetric hardware and managed software T Cao, SM Blackburn, T Gao, KS McKinley ACM SIGARCH Computer Architecture News 40 (3), 225-236, 2012 | 138 | 2012 |
Looking back on the language and hardware revolutions: measured power, performance, and scaling H Esmaeilzadeh, T Cao, Y Xi, SM Blackburn, KS McKinley ACM SIGARCH Computer Architecture News 39 (1), 319-332, 2011 | 113 | 2011 |
Parallel processing systems for big data: a survey Y Zhang, T Cao, S Li, X Tian, L Yuan, H Jia, AV Vasilakos Proceedings of the IEEE 104 (11), 2114-2136, 2016 | 112 | 2016 |
Nn-meter: Towards accurate latency prediction of deep-learning model inference on diverse edge devices LL Zhang, S Han, J Wei, N Zheng, T Cao, Y Yang, Y Liu Proceedings of the 19th Annual International Conference on Mobile Systems …, 2021 | 102 | 2021 |
Panthera: Holistic memory management for big data processing over hybrid memories C Wang, H Cui, T Cao, J Zigman, H Volos, O Mutlu, F Lv, X Feng, GH Xu Proceedings of the 40th ACM SIGPLAN Conference on Programming Language …, 2019 | 68 | 2019 |
WADE: Writeback-aware dynamic cache management for NVM-based main memory system Z Wang, S Shan, T Cao, J Gu, Y Xu, S Mu, Y Xie, DA Jiménez ACM Transactions on Architecture and Code Optimization (TACO) 10 (4), 1-21, 2013 | 58 | 2013 |
Asymo: scalable and efficient deep-learning inference on asymmetric mobile cpus M Wang, S Ding, T Cao, Y Liu, F Xu Proceedings of the 27th Annual International Conference on Mobile Computing …, 2021 | 47 | 2021 |
Looking back and looking forward: power, performance, and upheaval H Esmaeilzadeh, T Cao, X Yang, SM Blackburn, KS McKinley Communications of the ACM 55 (7), 105-114, 2012 | 42 | 2012 |
CoDL: efficient CPU-GPU co-execution for deep learning inference on mobile devices. F Jia, D Zhang, T Cao, S Jiang, Y Liu, J Ren, Y Zhang MobiSys 22, 209-221, 2022 | 32 | 2022 |
Portable performance on asymmetric multicore processors I Jibaja, T Cao, SM Blackburn, KS McKinley Proceedings of the 2016 International Symposium on Code Generation and …, 2016 | 29 | 2016 |
What is happening to power, performance, and software? H Esmaeilzadeh, T Cao, X Yang, S Blackburn, K McKinley IEEE Micro 32 (3), 110-121, 2012 | 26 | 2012 |
To bridge neural network design and real-world performance: A behaviour study for neural networks X Tang, S Han, LL Zhang, T Cao, Y Liu Proceedings of Machine Learning and Systems 3, 21-37, 2021 | 22 | 2021 |
TL-plane-based multi-core energy-efficient real-time scheduling algorithm for sporadic tasks D Zhang, D Guo, F Chen, F Wu, T Wu, T Cao, S Jin ACM Transactions on Architecture and Code Optimization (TACO) 8 (4), 1-20, 2012 | 18 | 2012 |
Efficient management for hybrid memory in managed language runtime C Wang, T Cao, J Zigman, F Lv, Y Zhang, X Feng Network and Parallel Computing: 13th IFIP WG 10.3 International Conference …, 2016 | 16 | 2016 |
Elasticvit: Conflict-aware supernet training for deploying fast vision transformer on diverse mobile devices C Tang, LL Zhang, H Jiang, J Xu, T Cao, Q Zhang, Y Yang, Z Wang, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 13 | 2023 |
Profiling and optimizing deep learning inference on mobile GPUs S Jiang, L Ran, T Cao, Y Xu, Y Liu Proceedings of the 11th ACM SIGOPS Asia-Pacific Workshop on Systems, 75-81, 2020 | 12 | 2020 |
Integer or floating point? new outlooks for low-bit quantization on large language models Y Zhang, L Zhao, S Cao, W Wang, T Cao, F Yang, M Yang, S Zhang, N Xu arXiv preprint arXiv:2305.12356, 2023 | 11 | 2023 |
Romou: Rapidly generate high-performance tensor kernels for mobile gpus R Liang, T Cao, J Wen, M Wang, Y Wang, J Zou, Y Liu Proceedings of the 28th Annual International Conference on Mobile Computing …, 2022 | 11 | 2022 |
MobiDepth: Real-time depth estimation using on-device dual cameras J Zhang, H Yang, J Ren, D Zhang, B He, T Cao, Y Li, Y Zhang, Y Liu Proceedings of the 28th Annual International Conference on Mobile Computing …, 2022 | 10 | 2022 |
Unified holistic memory management supporting multiple big data processing frameworks over hybrid memories L Chen, J Zhao, C Wang, T Cao, J Zigman, H Volos, O Mutlu, F Lv, X Feng, ... ACM Transactions on Computer Systems (TOCS) 39 (1-4), 1-38, 2022 | 9 | 2022 |