A survey on green-energy-aware power management for datacenters

F Kong, X Liu - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Megawatt-scale datacenters have emerged to meet the increasing demand for IT
applications and services. The hunger for power brings large electricity bills to datacenter …

Using straggler replication to reduce latency in large-scale parallel computing

D Wang, G Joshi, G Wornell - ACM SIGMETRICS Performance …, 2015 - dl.acm.org
In cloud computing jobs consisting of many tasks run in parallel, the tasks on the slowest
machines (straggling tasks) become the bottleneck in the completion of the job. One way to …

Optimization of cloud task processing with checkpoint-restart mechanism

S Di, Y Robert, F Vivien, D Kondo, CL Wang… - Proceedings of the …, 2013 - dl.acm.org
In this paper, we aim at optimizing fault-tolerance techniques based on a
checkpointing/restart mechanism, in the context of cloud computing. Our contribution is three …

Effective straggler mitigation: Which clones should attack and when?

MF Aktas, P Peng, E Soljanin - ACM SIGMETRICS Performance …, 2017 - dl.acm.org
Motivation: Distributed (computing) systems aim to atain scalability through parallel
execution of multiple tasks constituting a job. Each of these tasks is run on a separate node …

Straggler mitigation by delayed relaunch of tasks

MF Aktas, P Peng, E Soljanin - ACM SIGMETRICS Performance …, 2018 - dl.acm.org
Motivation: Distributed (computing) systems aim to attain scalability through parallel
execution of multiple tasks constituting a job. Each task is run on a separate node, and the …

Characterizing and modeling cloud applications/jobs on a Google data center

S Di, D Kondo, F Cappello - The Journal of Supercomputing, 2014 - Springer
In this paper, we characterize and model Google applications and jobs, based on a 1-month
Google trace from a large-scale Google data center. We address four contributions:(1) we …

Green energy efficient scheduling management

I De Courchelle, T Guérout, G Da Costa… - … Modelling Practice and …, 2019 - Elsevier
The analysis of the energy efficiency in Cloud Computing infrastructures has become an
important research domain as the utilization rate of the various on-demand services is daily …

Skynet: Performance-driven resource management for dynamic workloads

Y Sfakianakis, M Marazakis… - 2021 IEEE 14th …, 2021 - ieeexplore.ieee.org
A primary concern for cloud operators is to increase resource utilization while maintaining
good performance for applications. This is particularly difficult to achieve for three reasons …

Optically connected memory for disaggregated data centers

J Gonzalez, MG Palma, M Hattink… - Journal of Parallel and …, 2022 - Elsevier
Recent advances in integrated photonics enable the implementation of reconfigurable, high-
bandwidth, and low energy-per-bit interconnects in next-generation data centers. We …

Controlled access to cloud resources for mitigating Economic Denial of Sustainability (EDoS) attacks

ZA Baig, SM Sait, F Binbeshr - Computer Networks, 2016 - Elsevier
Cloud computing is a paradigm that provides scalable IT resources as a service over the
Internet. Vulnerabilities in the cloud infrastructure have been readily exploited by the …