MapReduce optimization using regulated dynamic prioritization

T Sandholm, K Lai - Proceedings of the eleventh international joint …, 2009 - dl.acm.org
We present a system for allocating resources in shared data and compute clusters that
improves MapReduce job scheduling in three ways. First, the system uses regulated and …

Resource-aware adaptive scheduling for mapreduce clusters

J Polo, C Castillo, D Carrera, Y Becerra… - Middleware 2011: ACM …, 2011 - Springer
We present a resource-aware scheduling technique for MapReduce multi-job workloads that
aims at improving resource utilization across machines while observing completion time …

[PDF][PDF] Job scheduling for multi-user mapreduce clusters

M Zaharia, D Borthakur, JS Sarma… - … University of California …, 2009 - academia.edu
Sharing a MapReduce cluster between users is attractive because it enables statistical
multiplexing (lowering costs) and allows users to share a common large data set. However …

Aria: automatic resource inference and allocation for mapreduce environments

A Verma, L Cherkasova, RH Campbell - Proceedings of the 8th ACM …, 2011 - dl.acm.org
MapReduce and Hadoop represent an economically compelling alternative for efficient
large scale data processing and advanced analytics in the enterprise. A key challenge in …

Two sides of a coin: Optimizing the schedule of mapreduce jobs to minimize their makespan and improve cluster performance

A Verma, L Cherkasova… - 2012 IEEE 20th …, 2012 - ieeexplore.ieee.org
Large-scale MapReduce clusters that routinely process petabytes of unstructured and semi-
structured data represent a new entity in the changing landscape of clouds. A key challenge …

Orchestrating an ensemble of MapReduce jobs for minimizing their makespan

A Verma, L Cherkasova… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
Cloud computing offers an attractive option for businesses to rent a suitable size
MapReduce cluster, consume resources as a service, and pay only for resources that were …

Cost-effective resource provisioning for mapreduce in a cloud

B Palanisamy, A Singh, L Liu - IEEE Transactions on Parallel …, 2014 - ieeexplore.ieee.org
This paper presents a new MapReduce cloud service model, Cura, for provisioning cost-
effective MapReduce services in a cloud. In contrast to existing MapReduce cloud services …

Purlieus: locality-aware resource allocation for MapReduce in a cloud

B Palanisamy, A Singh, L Liu, B Jain - Proceedings of 2011 international …, 2011 - dl.acm.org
We present Purlieus, a MapReduce resource allocation system aimed at enhancing the
performance of MapReduce jobs in the cloud. Purlieus provisions virtual MapReduce …

Play it again, simmr!

A Verma, L Cherkasova… - 2011 IEEE International …, 2011 - ieeexplore.ieee.org
A typical MapReduce cluster is shared among different users and multiple applications. A
challenging problem in such shared environments is the ability to efficiently control resource …

Balancing reducer skew in MapReduce workloads using progressive sampling

SR Ramakrishnan, G Swart, A Urmanov - Proceedings of the Third ACM …, 2012 - dl.acm.org
The elapsed time of a parallel job depends on the completion time of its longest running
constituent. We present a static load balancing algorithm that distributes work evenly across …