Challenges and solutions for processing real-time big data stream: a systematic literature review

E Mehmood, T Anees - IEEE Access, 2020 - ieeexplore.ieee.org
Contribution: Recently, real-time data warehousing (DWH) and big data streaming have
become ubiquitous due to the fact that a number of business organizations are gearing up to …

A MapReduce task scheduling algorithm for deadline constraints

Z Tang, J Zhou, K Li, R Li - Cluster computing, 2013 - Springer
The current works about MapReduce task scheduling with deadline constraints neither take
the differences of Map and Reduce task, nor the cluster's heterogeneity into account. This …

Cache conscious star-join in MapReduce environments

G Zhou, Y Zhu, G Wang - Proceedings of the 2nd International Workshop …, 2013 - dl.acm.org
With the popularity of big data and cloud computing, data parallel framework MapReduce
based data warehouse systems are used widely. Column store is a default data placement …

[PDF][PDF] Cloud-aware processing of MapReduce-based OLAP applications

H Han, YC Lee, S Choi… - … of the Eleventh …, 2013 - crpit.scem.westernsydney.edu.au
As the volume of data to be processed in a timely manner soars, the scale of computing and
storage systems has much trouble keeping up with such a rate of explosive data growth. A …

Faster cloud star joins with reduced disk spill and network communication

JJ Brito, T Mosqueiro, RR Ciferri… - Procedia Computer …, 2016 - Elsevier
Combining powerful parallel frameworks and on-demand commodity hardware, cloud
computing has made both analytics and decision support systems canonical to enterprises …

Defining energy consumption plans for data querying processes

R Gonçalves, J Saraiva, O Belo - 2014 IEEE Fourth …, 2014 - ieeexplore.ieee.org
During the last few years, we have been witnessing a significant increase in research about
the development and production of hardware and software components with low levels of …

A distributed incremental information acquisition model for large-scale text data

S Sun, J Gong, AY Zomaya, A Wu - Cluster computing, 2019 - Springer
Timely discovering and acquiring information from incremental data on the Internet is a hot
topic in a big data era. This paper presents a distributed incremental information acquisition …

[PDF][PDF] 面向神威· 太湖之光的PETSc 可扩展异构并行算法及其性能优化

洪文杰, 李肯立, 全哲, 阳王东, 李克勤, 郝子宇… - 计算机学报, 2017 - cjc.ict.ac.cn
摘要共性数学库PETSc (Portable, Extensible Toolkit for Scientific Computation)
是高性能计算的基础模块, 是超级计算机计算环境的基础算法库之一, 其性能直接影响调用数学 …

A real-time scheduling strategy based on processing framework of Hadoop

F Chen, J Liu, Y Zhu - 2017 IEEE International Congress on Big …, 2017 - ieeexplore.ieee.org
Due to the batch processing capability and distributed storage, MapReduce and HDFS have
always been the core parts of Hadoop. Nowadays, many studies still focus on improving and …

Single‐scan: a fast star‐join query processing algorithm

V Purdilă, ŞG Pentiuc - Software: Practice and Experience, 2016 - Wiley Online Library
A data warehouse can store very large amounts of data that should be processed in parallel
in order to achieve reasonable query execution times. The MapReduce programming model …