相关文章- 学术资源搜索

[PDF][PDF] Reining in the outliers in {Map-Reduce} clusters using mantri

G Ananthanarayanan, S Kandula… - … USENIX Symposium on …, 2010 - usenix.org

Experience from an operational Map-Reduce cluster reveals that outliers significantly
prolong job completion. e causes for outliers include run-time contention for processor …

被引用次数：1002 相关文章所有 27 个版本

[PDF] psu.edu

Balancing reducer skew in MapReduce workloads using progressive sampling

SR Ramakrishnan, G Swart, A Urmanov - Proceedings of the Third ACM …, 2012 - dl.acm.org

The elapsed time of a parallel job depends on the completion time of its longest running
constituent. We present a static load balancing algorithm that distributes work evenly across …

被引用次数：129 相关文章所有 5 个版本

[PDF] illinois.edu

Breaking the MapReduce stage barrier

A Verma, B Cho, N Zea, I Gupta, RH Campbell - Cluster computing, 2013 - Springer

The MapReduce model uses a barrier between the Map and Reduce stages. This provides
simplicity in both programming and implementation. However, in many situations, this barrier …

被引用次数：106 相关文章所有 26 个版本

[PDF] researchgate.net

Clash of the titans: Mapreduce vs. spark for large scale data analytics

J Shi, Y Qiu, UF Minhas, L Jiao, C Wang… - Proceedings of the …, 2015 - dl.acm.org

MapReduce and Spark are two very popular open source cluster computing frameworks for
large scale data analytics. These frameworks hide the complexity of task parallelism and …

被引用次数：319 相关文章所有 11 个版本

[PDF] openproceedings.org

Adaptive MapReduce using situation-aware mappers

R Vernica, A Balmin, KS Beyer… - Proceedings of the 15th …, 2012 - dl.acm.org

We propose new adaptive runtime techniques for MapReduce that improve performance
and simplify job tuning. We implement these techniques by breaking a key assumption of …

被引用次数：96 相关文章所有 10 个版本

[PDF] researchgate.net

MapReduce optimization using regulated dynamic prioritization

T Sandholm, K Lai - Proceedings of the eleventh international joint …, 2009 - dl.acm.org

We present a system for allocating resources in shared data and compute clusters that
improves MapReduce job scheduling in three ways. First, the system uses regulated and …

被引用次数：260 相关文章所有 5 个版本

[PDF] audentia-gestion.fr

Scarlett: coping with skewed content popularity in mapreduce clusters

G Ananthanarayanan, S Agarwal, S Kandula… - Proceedings of the sixth …, 2011 - dl.acm.org

To improve data availability and resilience MapReduce frameworks use file systems that
replicate data uniformly. However, analysis of job logs from a large production cluster shows …

被引用次数：426 相关文章所有 13 个版本

Delay tails in MapReduce scheduling

J Tan, X Meng, L Zhang - Proceedings of the 12th ACM SIGMETRICS …, 2012 - dl.acm.org

MapReduce/Hadoop production clusters exhibit heavy-tailed characteristics for job
processing times. These phenomena are resultant of the workload features and the adopted …

被引用次数：121 相关文章所有 3 个版本

[PDF] academia.edu

Aria: automatic resource inference and allocation for mapreduce environments

A Verma, L Cherkasova, RH Campbell - Proceedings of the 8th ACM …, 2011 - dl.acm.org

MapReduce and Hadoop represent an economically compelling alternative for efficient
large scale data processing and advanced analytics in the enterprise. A key challenge in …

被引用次数：610 相关文章所有 12 个版本

[PDF] acm.org

MapReduce: simplified data processing on large clusters

J Dean, S Ghemawat - Communications of the ACM, 2008 - dl.acm.org

MapReduce is a programming model and an associated implementation for processing and
generating large datasets that is amenable to a broad variety of real-world tasks. Users …

被引用次数：23545 相关文章所有 86 个版本

高级搜索

QQ 群