Approxhadoop: Bringing approximations to mapreduce frameworks

I Goiri, R Bianchini, S Nagarakatte… - Proceedings of the …, 2015 - dl.acm.org
We propose and evaluate a framework for creating and running approximation-enabled
MapReduce programs. Specifically, we propose approximation mechanisms that fit naturally …

Towards automatic optimization of MapReduce programs

S Babu - Proceedings of the 1st ACM symposium on Cloud …, 2010 - dl.acm.org
Timely and cost-effective processing of large datasets has become a critical ingredient for
the success of many academic, government, and industrial organizations. The combination …

Google's MapReduce programming model—Revisited

R Lämmel - Science of computer programming, 2008 - Elsevier
Google's MapReduce programming model serves for processing large data sets in a
massively parallel manner. We deliver the first rigorous description of the model including its …

Automatic optimization for MapReduce programs

E Jahani, MJ Cafarella, C Ré - arXiv preprint arXiv:1104.3217, 2011 - arxiv.org
The MapReduce distributed programming framework has become popular, despite
evidence that current implementations are inefficient, requiring far more hardware than a …

MapReduce: simplified data processing on large clusters

J Dean, S Ghemawat - Communications of the ACM, 2008 - dl.acm.org
MapReduce is a programming model and an associated implementation for processing and
generating large datasets that is amenable to a broad variety of real-world tasks. Users …

[图书][B] MapReduce design patterns

D Miner, A Shook - 2012 - books.google.com
Until now, design patterns for the MapReduce framework have been scattered among
various research papers, blogs, and books. This handy guide brings together a unique …

Profiling, what-if analysis, and cost-based optimization of mapreduce programs

H Herodotou, S Babu - Proceedings of the VLDB Endowment, 2011 - dl.acm.org
MapReduce has emerged as a viable competitor to database systems in big data analytics.
MapReduce programs are being written for a wide variety of application domains including …

[PDF][PDF] Puma: Purdue mapreduce benchmarks suite

F Ahmad, S Lee, M Thottethodi, TN Vijaykumar - 2012 - docs.lib.purdue.edu
MapReduce [5] is a well-known programming model, developed within Google, for
processing large amounts of raw data such as crawled documents or web request logs on a …

Accelerating mapreduce on a coupled cpu-gpu architecture

L Chen, X Huo, G Agrawal - SC'12: Proceedings of the …, 2012 - ieeexplore.ieee.org
The work presented here is driven by two observations. First, heterogeneous architectures
that integrate a CPU and a GPU on the same chip are emerging, and hold much promise for …

Using realistic simulation for performance analysis of mapreduce setups

G Wang, AR Butt, P Pandey, K Gupta - … of the 1st ACM workshop on …, 2009 - dl.acm.org
Recently, there has been a huge growth in the amount of data processed by enterprises and
the scientific computing community. Two promising trends ensure that applications will be …