Delay tails in MapReduce scheduling

J Tan, X Meng, L Zhang - Proceedings of the 12th ACM SIGMETRICS …, 2012 - dl.acm.org
MapReduce/Hadoop production clusters exhibit heavy-tailed characteristics for job
processing times. These phenomena are resultant of the workload features and the adopted …

Deadline-aware MapReduce job scheduling with dynamic resource availability

D Cheng, X Zhou, Y Xu, L Liu… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
As MapReduce is becoming ubiquitous in large-scale data analysis, many recent studies
have shown that the performance of MapReduce could be improved by different job …

Dynamicmr: A dynamic slot allocation optimization framework for mapreduce clusters

S Tang, BS Lee, B He - IEEE Transactions on Cloud …, 2014 - ieeexplore.ieee.org
MapReduce is a popular computing paradigm for large-scale data processing in cloud
computing. However, the slot-based MapReduce system (eg, Hadoop MRv1) can suffer from …

Natjam: Design and evaluation of eviction policies for supporting priorities and deadlines in mapreduce clusters

B Cho, M Rahman, T Chajed, I Gupta, C Abad… - Proceedings of the 4th …, 2013 - dl.acm.org
This paper presents Natjam, a system that supports arbitrary job priorities, hard real-time
scheduling, and efficient preemption for Mapreduce clusters that are resource-constrained …

Aria: automatic resource inference and allocation for mapreduce environments

A Verma, L Cherkasova, RH Campbell - Proceedings of the 8th ACM …, 2011 - dl.acm.org
MapReduce and Hadoop represent an economically compelling alternative for efficient
large scale data processing and advanced analytics in the enterprise. A key challenge in …

A load-aware scheduler for MapReduce framework in heterogeneous cloud environments

HH You, CC Yang, JL Huang - … of the 2011 ACM Symposium on Applied …, 2011 - dl.acm.org
MapReduce is becoming a popular programming model for large-scale data processing in
cloud computing environments. Hadoop MapReduce is the most popular open-source …

LsPS: A job size-based scheduler for efficient task assignments in Hadoop

Y Yao, J Tai, B Sheng, N Mi - IEEE Transactions on Cloud …, 2014 - ieeexplore.ieee.org
The MapReduce paradigm and its open source implementation Hadoop are emerging as an
important standard for large-scale data-intensive processing in both industry and academia …

Enhancement of Xen's scheduler for MapReduce workloads

H Kang, Y Chen, JL Wong, R Sion, J Wu - Proceedings of the 20th …, 2011 - dl.acm.org
As the trends move towards data outsourcing and cloud computing, the efficiency of
distributed data centers increases in importance. Cloud-based services such as Amazon's …

Performance analysis of coupling scheduler for mapreduce/hadoop

J Tan, X Meng, L Zhang - 2012 Proceedings IEEE INFOCOM, 2012 - ieeexplore.ieee.org
For MapReduce/Hadoop, map and reduce phases exhibit fundamentally distinguishing
characteristics. Additionally, these two phases admit complicated and tight dependency on …

[PDF][PDF] Job scheduling for multi-user mapreduce clusters

M Zaharia, D Borthakur, JS Sarma… - … , Tech. Rep. UCB …, 2009 - digitalassets.lib.berkeley.edu
Sharing a MapReduce cluster between users is attractive because it enables statistical
multiplexing (lowering costs) and allows users to share a common large data set. However …