Design tradeoffs in CXL-based memory pools for public cloud platforms

DS Berger, D Ernst, H Li, P Zardoshti, M Shah… - IEEE Micro, 2023 - ieeexplore.ieee.org
Dynamic random-access memory (DRAM) is a key driver of performance and cost in public
cloud servers. At the same time, a significant amount of DRAM is underutilized due to …

Is advance knowledge of flow sizes a plausible assumption?

V Ðukić, SA Jyothi, B Karlaš, M Owaida… - … USENIX Symposium on …, 2019 - usenix.org
Recent research has proposed several packet, flow, and coflow scheduling methods that
could substantially improve data center network performance. Most of this work assumes …

Model-driven cluster resource management for ai workloads in edge clouds

Q Liang, WA Hanafy, A Ali-Eldin, P Shenoy - ACM Transactions on …, 2023 - dl.acm.org
Since emerging edge applications such as Internet of Things (IoT) analytics and augmented
reality have tight latency constraints, hardware AI accelerators have been recently proposed …

Time series forecasting using facebook prophet for cloud resource management

M Daraghmeh, A Agarwal, R Manzano… - 2021 IEEE …, 2021 - ieeexplore.ieee.org
The heterogeneous nature of workloads running in cloud environments makes future
resource usage prediction a complicated problem. Virtual machines can be described in five …

Cluster resource scheduling in cloud computing: literature review and research challenges

W Khallouli, J Huang - The Journal of supercomputing, 2022 - Springer
Scheduling plays a pivotal role in cloud computing systems. Designing an efficient
scheduler is a challenging task. The challenge comes from several aspects, including the …

Hindsight learning for mdps with exogenous inputs

SR Sinclair, FV Frujeri, CA Cheng… - International …, 2023 - proceedings.mlr.press
Many resource management problems require sequential decision-making under
uncertainty, where the only uncertainty affecting the decision outcomes are exogenous …

Characterizing co-located datacenter workloads: An alibaba case study

Y Cheng, Z Chai, A Anwar - Proceedings of the 9th asia-pacific …, 2018 - dl.acm.org
To improve resource utilization and thereby reduce costs, leading cloud infrastructure
operators such as Google and Alibaba co-locate transient batch jobs with long-running …

{Prediction-Based} power oversubscription in cloud platforms

AG Kumbhare, R Azimi, I Manousakis… - 2021 USENIX Annual …, 2021 - usenix.org
Prior work has used power capping to shave rare power peaks and add more servers to a
datacenter, thereby oversubscribing its resources and lowering capital costs. This works well …

Scavenger: A black-box batch workload resource manager for improving utilization in cloud environments

SA Javadi, A Suresh, M Wajahat, A Gandhi - Proceedings of the ACM …, 2019 - dl.acm.org
Resource under-utilization is common in cloud data centers. Prior works have proposed
improving utilization by running provider workloads in the background, colocated with tenant …

Redy: remote dynamic memory cache

Q Zhang, PA Bernstein, DS Berger… - arXiv preprint arXiv …, 2021 - arxiv.org
Redy is a cloud service that provides high performance caches using RDMA-accessible
remote memory. An application can customize the performance of each cache with a service …