Nexus: a GPU cluster engine for accelerating DNN-based video analysis H Shen, L Chen, Y Jin, L Zhao, B Kong, M Philipose, A Krishnamurthy, ... Proceedings of the 27th ACM Symposium on Operating Systems Principles, 322-337, 2019 | 221* | 2019 |
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly Y Jin, T Zhou, L Zhao, Y Zhu, C Guo, M Canini, A Krishnamurthy arXiv preprint arXiv:2105.10762, 2021 | 25 | 2021 |
Efficient Direct-Connect Topologies for Collective Communications L Zhao, S Pal, T Chugh, W Wang, J Fantl, P Basu, J Khoury, ... arXiv preprint arXiv:2202.03356, 2022 | 7* | 2022 |
Efficient All-to-All Collective Communication Schedules for Direct-Connect Topologies S Pal, L Zhao, J Fantl, J Khoury, A Krishnamurthy, P Basu arXiv preprint arXiv:2309.13541, 2023 | 3 | 2023 |
Bandwidth Optimal Pipeline Schedule for Collective Communication L Zhao, A Krishnamurthy arXiv preprint arXiv:2305.18461, 2023 | 2 | 2023 |
ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics L Zhao, S Maleki, Z Yang, H Pourreza, A Shah, C Hwang, ... arXiv preprint arXiv:2402.06787, 2024 | 1 | 2024 |
Rethinking Machine Learning Collective Communication as a Multi-Commodity Flow Problem X Liu, B Arzani, SKR Kakarla, L Zhao, V Liu, M Castro, S Kandula, ... Proceedings of the ACM SIGCOMM 2024 Conference, 16-37, 2024 | | 2024 |
Nexus: A GPU Cluster Engine for Accelerating Neural Networks Based Video Analysis H Shen, L Chen, Y Jin, L Zhao, B Kong, M Philipose, A Krishnamurthy, ... | | |