FIRM: An Intelligent Fine-Grained Resource Management Framework for SLO-Oriented Microservices H Qiu, SS Banerjee, S Jha, ZT Kalbarczyk, RK Iyer Proceedings of The 14th USENIX Symposium on Operating Systems Design and …, 2020 | 221 | 2020 |
OWL: Understanding and Detecting Concurrency Attacks S Zhao, R Gu, H Qiu, TO Li, Y Wang, H Cui, J Yang Proceedings of The 48th IEEE/IFIP International Conference on Dependable …, 2018 | 26 | 2018 |
Reinforcement Learning for Resource Management in Multi-tenant Serverless Platforms H Qiu, W Mao, A Patke, C Wang, H Franke, ZT Kalbarczyk, T Başar, ... Proceedings of the 2nd European Workshop on Machine Learning and Systems, 20-28, 2022 | 19 | 2022 |
PLOVER: Fast, Multi-core Scalable Virtual Machine Fault-tolerance C Wang, X Chen, W Jia, B Li, H Qiu, S Zhao, H Cui Proceedings of The 15th USENIX Symposium on Networked Systems Design and …, 2018 | 19 | 2018 |
AWARE: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems H Qiu, W Mao, C Wang, H Franke, A Youssef, ZT Kalbarczyk, T Başar, ... Proceedings of the 2023 USENIX Annual Technical Conference (ATC 2023), 2023 | 16 | 2023 |
A Mean-Field Game Approach to Cloud Resource Management with Function Approximation W Mao, H Qiu, C Wang, H Franke, ZT Kalbarczyk, RK Iyer, T Basar Proceedings of the Thirty-Sixth Conference on Neural Information Processing …, 2022 | 14 | 2022 |
SIMPPO: A Scalable and Incremental Online Learning Framework for Serverless Resource Management H Qiu, W Mao, A Patke, C Wang, H Franke, ZT Kalbarczyk, T Başar, ... Proceedings of the 13th Symposium on Cloud Computing (SoCC 2022), 306-322, 2022 | 14 | 2022 |
Is Function-as-a-Service a Good Fit for Latency-Critical Services? H Qiu, S Jha, SS Banerjee, A Patke, C Wang, F Hubertus, ZT Kalbarczyk, ... Proceedings of The 7th International Workshop on Serverless Computing (WoSC7 …, 2021 | 10 | 2021 |
Pre-processed tracing data for popular microservice benchmarks H Qiu, SS Banerjee, S Jha, ZT Kalbarczyk, R Iyer | 6 | 2020 |
A geography-based P2P overlay network for fast and robust blockchain systems H Qiu, T Ji, S Zhao, X Chen, J Qi, H Cui, S Wang IEEE Transactions on Services Computing, 2022 | 5 | 2022 |
Delay sensitivity-driven congestion mitigation for hpc systems A Patke, S Jha, H Qiu, J Brandt, A Gentile, J Greenseid, Z Kalbarczyk, ... Proceedings of the ACM International Conference on Supercomputing, 342-353, 2021 | 5 | 2021 |
Evaluating hardware memory disaggregation under delay and contention A Patke, H Qiu, S Jha, S Venugopal, M Gazzetti, C Pinto, Z Kalbarczyk, ... 2022 IEEE International Parallel and Distributed Processing Symposium …, 2022 | 4 | 2022 |
FLASH: Fast Model Adaption in ML-Centric Cloud Platforms H Qiu, W Mao, A Patke, S Cui, C Wang, H Franke, ZT Kalbarczyk, T Basar, ... The Seventh Annual Conference on Machine Learning and Systems (MLSys 2024), 2024 | 2 | 2024 |
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction H Qiu, W Mao, A Patke, S Cui, S Jha, C Wang, H Franke, ZT Kalbarczyk, ... The 5th International Workshop on Cloud Intelligence / AIOps (AIOps '24) Co …, 2024 | 1 | 2024 |
PALM: Adaptive Resource Allocation for Datacenter Power Capping H Qiu, L Zhang, C Wang, H Franke, ZT Kalbarczyk, RK Iyer Workshop on ML for Systems at NeurIPS 2023, 2023 | 1 | 2023 |
On the Promise and Challenges of Foundation Models for Learning-based Cloud Systems Management H Qiu, W Mao, C Wang, H Franke, ZT Kalbarczyk, T Basar, RK Iyer Workshop on ML for Systems at NeurIPS 2023, 2023 | 1 | 2023 |
SLO beyond the Hardware Isolation Limits H Qiu, Y Chen, T Xu, ZT Kalbarczyk, RK Iyer arXiv preprint arXiv:2109.11666, 2021 | 1 | 2021 |
Application-aware Congestion Mitigation for High-Performance Computing Systems A Patke, S Jha, H Qiu, J Brandt, A Gentile, J Greenseid, Z Kalbarczyk, ... arXiv preprint arXiv:2012.07755, 2020 | 1 | 2020 |
Power-aware Deep Learning Model Serving with μ-Serve H Qiu, W Mao, A Patke, S Cui, S Jha, C Wang, H Franke, ZT Kalbarczyk, ... USENIX Annual Technical Conference 2024 (ATC 2024), 2024 | | 2024 |
Queue Management for Large Language Model Serving A Patke, D Reddy, S Jha, C Pinto, H Qiu, S Cui, C Narayanaswami, ... The 5th International Workshop on Cloud Intelligence / AIOps (AIOps '24) Co …, 2024 | | 2024 |