In cloud computing jobs consisting of many tasks run in parallel, the tasks on the slowest machines (straggling tasks) become the bottleneck in the completion of the job. One way to …
In this paper, we aim at optimizing fault-tolerance techniques based on a checkpointing/restart mechanism, in the context of cloud computing. Our contribution is three …
Motivation: Distributed (computing) systems aim to atain scalability through parallel execution of multiple tasks constituting a job. Each of these tasks is run on a separate node …
Motivation: Distributed (computing) systems aim to attain scalability through parallel execution of multiple tasks constituting a job. Each task is run on a separate node, and the …
In this paper, we characterize and model Google applications and jobs, based on a 1-month Google trace from a large-scale Google data center. We address four contributions:(1) we …
The analysis of the energy efficiency in Cloud Computing infrastructures has become an important research domain as the utilization rate of the various on-demand services is daily …
A primary concern for cloud operators is to increase resource utilization while maintaining good performance for applications. This is particularly difficult to achieve for three reasons …
Recent advances in integrated photonics enable the implementation of reconfigurable, high- bandwidth, and low energy-per-bit interconnects in next-generation data centers. We …
Cloud computing is a paradigm that provides scalable IT resources as a service over the Internet. Vulnerabilities in the cloud infrastructure have been readily exploited by the …