Morphling: Fast, Near-Optimal Auto-Configuration for Cloud-Native Model Serving L Wang, L Yang, Y Yu, W Wang, B Li, X Sun, J He, L Zhang Proceedings of ACM Symposium on Cloud Computing (SoCC '21), 639–653, 2021 | 34 | 2021 |
Beware of Fragmentation: Scheduling GPU-Sharing Workloads with Fragmentation Gradient Descent Q Weng, L Yang, Y Yu, W Wang, X Tang, G Yang, L Zhang 2023 USENIX Annual Technical Conference (USENIX ATC 23), 995-1008, 2023 | 10 | 2023 |
Workload Consolidation in Alibaba Clusters: The Good, the Bad, and the Ugly Y Zhang, Y Yu, W Wang, Q Chen, J Wu, Z Zhang, J Zhong, T Ding, ... Proceedings of ACM Symposium on Cloud Computing (SoCC '22), 210-225, 2022 | 7 | 2022 |