Adaptive resource efficient microservice deployment in cloud-edge continuum K Fu, W Zhang, Q Chen, D Zeng, M Guo IEEE Transactions on Parallel and Distributed Systems 33 (8), 1825-1840, 2021 | 73 | 2021 |
Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters W Zhang, W Cui, K Fu, Q Chen, DE Mawhirter, B Wu, C Li, M Guo Proceedings of the ACM international conference on supercomputing, 58-68, 2019 | 43 | 2019 |
Qos-aware and resource efficient microservice deployment in cloud-edge continuum K Fu, W Zhang, Q Chen, D Zeng, X Peng, W Zheng, M Guo 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021 | 41 | 2021 |
Toward qos-awareness and improved utilization of spatial multitasking gpus W Zhang, Q Chen, N Zheng, W Cui, K Fu, M Guo IEEE Transactions on Computers 71 (4), 866-879, 2021 | 16 | 2021 |
Astraea: towards QoS-aware and resource-efficient multi-stage GPU services W Zhang, Q Chen, K Fu, N Zheng, Z Huang, J Leng, M Guo Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 14 | 2022 |
Characterizing and orchestrating VM reservation in geo-distributed clouds to improve the resource efficiency J Shi, K Fu, Q Chen, C Yang, P Huang, M Zhou, J Zhao, C Chen, M Guo Proceedings of the 13th Symposium on Cloud Computing, 94-109, 2022 | 10 | 2022 |
QoS-awareness of microservices with excessive loads via inter-datacenter scheduling J Shi, J Wang, K Fu, Q Chen, D Zeng, M Guo 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 9 | 2022 |
QoS-aware irregular collaborative inference for improving throughput of DNN services K Fu, J Shi, Q Chen, N Zheng, W Zhang, D Zeng, M Guo SC22: International Conference for High Performance Computing, Networking …, 2022 | 7 | 2022 |
Charm: Collaborative host and accelerator resource management for gpu datacenters W Zhang, K Fu, N Zheng, Q Chen, C Li, W Zheng, M Guo 2021 IEEE 39th International Conference on Computer Design (ICCD), 307-315, 2021 | 7 | 2021 |
Nodens: Enabling Resource Efficient and Fast {QoS} Recovery of Dynamic Microservice Applications in Datacenters J Shi, H Zhang, Z Tong, Q Chen, K Fu, M Guo 2023 USENIX Annual Technical Conference (USENIX ATC 23), 403-417, 2023 | 5 | 2023 |
BLAD: Adaptive Load Balanced Scheduling and Operator Overlap Pipeline For Accelerating The Dynamic GNN Training K Fu, Q Chen, Y Yang, J Shi, C Li, M Guo Proceedings of the International Conference for High Performance Computing …, 2023 | 3 | 2023 |
Towards QoS-aware and resource-efficient GPU microservices based on spatial multitasking GPUs in datacenters W Zhang, Q Chen, K Fu, N Zheng, Z Huang, J Leng, C Li, W Zheng, ... arXiv preprint arXiv:2005.02088, 2020 | 3 | 2020 |