Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks

A Fernández, S del Río, V López… - … : Data Mining and …, 2014 - Wiley Online Library
The term 'Big Data'has spread rapidly in the framework of Data Mining and Business
Intelligence. This new scenario can be defined by means of those problems that cannot be …

A survey on graphic processing unit computing for large‐scale data mining

A Cano - Wiley Interdisciplinary Reviews: Data Mining and …, 2018 - Wiley Online Library
General purpose computation using Graphic Processing Units (GPUs) is a well‐established
research area focusing on high‐performance computing solutions for massively …

MapReduce parallel programming model: a state-of-the-art survey

R Li, H Hu, H Li, Y Wu, J Yang - International Journal of Parallel …, 2016 - Springer
With the development of information technologies, we have entered the era of Big Data.
Google's MapReduce programming model and its open-source implementation in Apache …

Sparkcl: A unified programming framework for accelerators on heterogeneous clusters

O Segal, P Colangelo, N Nasiri, Z Qian… - arXiv preprint arXiv …, 2015 - arxiv.org
We introduce SparkCL, an open source unified programming framework based on Java,
OpenCL and the Apache Spark framework. The motivation behind this work is to bring …

GPU in-memory processing using spark for iterative computation

S Hong, W Choi, WK Jeong - 2017 17th IEEE/ACM …, 2017 - ieeexplore.ieee.org
Due to its simplicity and scalability, MapReduce has become a de facto standard computing
model for big data processing. Since the original MapReduce model was only appropriate …

Vispark: GPU-accelerated distributed visual computing using spark

W Choi, S Hong, WK Jeong - SIAM Journal on Scientific Computing, 2016 - SIAM
With the growing need of big-data processing in diverse application domains, MapReduce
(eg, Hadoop) has become one of the standard computing paradigms for large-scale …

Hadoop extensions for distributed computing on reconfigurable active SSD clusters

A Kaitoua, H Hajj, MAR Saghir, H Artail… - ACM Transactions on …, 2014 - dl.acm.org
In this article, we propose new extensions to Hadoop to enable clusters of reconfigurable
active solid-state drives (RASSDs) to process streaming data from SSDs using FPGAs. We …

High level programming for heterogeneous architectures

O Segal, M Margala, SR Chalamalasetti… - arXiv preprint arXiv …, 2014 - arxiv.org
This work presents an effort to bridge the gap between abstract high level programming and
OpenCL by extending an existing high level Java programming framework (APARAPI) …

Gpu-accelerated cloud computing for data-intensive applications

B Zhao, J Zhong, B He, Q Luo, W Fang… - Cloud Computing for …, 2014 - Springer
Recently, many large-scale data-intensive applications have emerged from the Internet and
science domains. They pose significant challenges on the performance, scalability and …

Power efficient mapreduce workload acceleration using integrated-gpu

SY Kim, J Bottleson, J Jin, P Bindu… - 2015 IEEE First …, 2015 - ieeexplore.ieee.org
With the pervasiveness of MapReduce-one of the most prominent programming models for
data parallelism in Apache Hadoop-, many researchers and developers have spent …