作者
Ren Li, Haibo Hu, Heng Li, Yunsong Wu, Jianxi Yang
发表日期
2016/8/1
期刊
International Journal of Parallel Programming
卷号
44
期号
4
页码范围
832-866
出版商
Springer US
简介
With the development of information technologies, we have entered the era of Big Data. Google’s MapReduce programming model and its open-source implementation in Apache Hadoop have become the dominant model for data-intensive processing because of its simplicity, scalability, and fault tolerance. However, several inherent limitations, such as lack of efficient scheduling and iteration computing mechanisms, seriously affect the efficiency and flexibility of MapReduce. To date, various approaches have been proposed to extend MapReduce model and improve runtime efficiency for different scenarios. In this review, we assess MapReduce to help researchers better understand these novel optimizations that have been taken to address its limitations. We first present the basic idea underlying MapReduce paradigm and describe several widely used open-source runtime systems. And then we discuss …
引用总数
2016201720182019202020212022202320247111123119695
学术搜索中的文章
R Li, H Hu, H Li, Y Wu, J Yang - International Journal of Parallel Programming, 2016