作者
Wagner Kolberg, Pedro De B Marcos, Julio CS Anjos, Alexandre KS Miyazaki, Claudio R Geyer, Luciana B Arantes
发表日期
2013/5/31
期刊
Parallel Computing
卷号
39
期号
4
页码范围
233-244
出版商
North-Holland
简介
MapReduce is a parallel programming model to process large datasets, and it was inspired by the Map and Reduce primitives from functional languages. Its first implementation was designed to run on large clusters of homogeneous machines. Though, in the last years, the model was ported to different types of environments, such as desktop grid and volunteer computing. To obtain a good performance in these environments, however, it is necessary to adapt some framework mechanisms, such as scheduling and data distribution algorithms. In this paper we present the MRSG simulator, which reproduces the MapReduce work-flow on top of the SimGrid simulation toolkit, and provides an API to implement and evaluate these new algorithms and policies for MapReduce. To evaluate the simulator, we compared its behavior against a real Hadoop MapReduce deployment. The results show an important similarity …
引用总数
201320142015201620172018201920202021397967641
学术搜索中的文章
W Kolberg, PB Marcos, JCS Anjos, AKS Miyazaki… - Parallel Computing, 2013