Large-scale cluster management at Google with Borg

A Verma, L Pedrosa, M Korupolu… - Proceedings of the …, 2015 - dl.acm.org
Google's Borg system is a cluster manager that runs hundreds of thousands of jobs, from
many thousands of different applications, across a number of clusters each with up to tens of …

Who limits the resource efficiency of my datacenter: An analysis of alibaba datacenter traces

J Guo, Z Chang, S Wang, H Ding, Y Feng… - Proceedings of the …, 2019 - dl.acm.org
Cloud platform provides great flexibility and cost-efficiency for end-users and cloud
operators. However, low resource utilization in modern datacenters brings huge wastes of …

Imbalance in the cloud: An analysis on alibaba cluster trace

C Lu, K Ye, G Xu, CZ Xu, T Bai - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
To improve resource efficiency and design intelligent scheduler for clouds, it is necessary to
understand the workload characteristics and machine utilization in large-scale cloud data …

A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment

Y Zhu, W Zhang, Y Chen, H Gao - EURASIP Journal on Wireless …, 2019 - Springer
Server workload in the form of cloud-end clusters is a key factor in server maintenance and
task scheduling. How to balance and optimize hardware resources and computation …

The elasticity and plasticity in semi-containerized co-locating cloud workload: A view from alibaba trace

Q Liu, Z Yu - Proceedings of the ACM Symposium on Cloud …, 2018 - dl.acm.org
Cloud computing with large-scale datacenters provides great convenience and cost-
efficiency for end users. However, the resource utilization of cloud datacenters is very low …

Is advance knowledge of flow sizes a plausible assumption?

V Ðukić, SA Jyothi, B Karlaš, M Owaida… - … USENIX Symposium on …, 2019 - usenix.org
Recent research has proposed several packet, flow, and coflow scheduling methods that
could substantially improve data center network performance. Most of this work assumes …

Security supportive energy-aware scheduling and energy policies for cloud environments

D Fernández-Cerero, A Jakóbik, D Grzonka… - Journal of Parallel and …, 2018 - Elsevier
Cloud computing (CC) systems are the most popular computational environments for
providing elastic and scalable services on a massive scale. The nature of such systems …

SCORE: Simulator for cloud optimization of resources and energy consumption

D Fernández-Cerero, A Fernández-Montes… - … Modelling Practice and …, 2018 - Elsevier
Achieving efficiency both in terms of resource utilisation and energy consumption is a
complex challenge, especially in large-scale wide-purpose data centers that serve cloud …

Cloud failure prediction based on traditional machine learning and deep learning

TN Tengku Asmawi, A Ismail, J Shen - Journal of Cloud Computing, 2022 - Springer
Cloud failure is one of the critical issues since it can cost millions of dollars to cloud service
providers, in addition to the loss of productivity suffered by industrial users. Fault tolerance …

Characterizing and orchestrating VM reservation in geo-distributed clouds to improve the resource efficiency

J Shi, K Fu, Q Chen, C Yang, P Huang, M Zhou… - Proceedings of the 13th …, 2022 - dl.acm.org
Cloud providers often build a geo-distributed cloud from multiple datacenters in different
geographic regions, to serve tenants at different locations. The tenants that run large scale …