Scatter-gather-merge: An efficient star-join query processing algorithm for data-parallel frameworks

E Mehmood, T Anees - IEEE Access, 2020 - ieeexplore.ieee.org

Contribution: Recently, real-time data warehousing (DWH) and big data streaming have
become ubiquitous due to the fact that a number of business organizations are gearing up to …

被引用次数：77 相关文章所有 2 个版本

A MapReduce task scheduling algorithm for deadline constraints

Z Tang, J Zhou, K Li, R Li - Cluster computing, 2013 - Springer

The current works about MapReduce task scheduling with deadline constraints neither take
the differences of Map and Reduce task, nor the cluster's heterogeneity into account. This …

被引用次数：90 相关文章所有 6 个版本

[PDF] psu.edu

Cache conscious star-join in MapReduce environments

G Zhou, Y Zhu, G Wang - Proceedings of the 2nd International Workshop …, 2013 - dl.acm.org

With the popularity of big data and cloud computing, data parallel framework MapReduce
based data warehouse systems are used widely. Column store is a default data placement …

被引用次数：27 相关文章所有 3 个版本

[PDF] westernsydney.edu.au

[PDF][PDF] Cloud-aware processing of MapReduce-based OLAP applications

H Han, YC Lee, S Choi… - … of the Eleventh …, 2013 - crpit.scem.westernsydney.edu.au

As the volume of data to be processed in a timely manner soars, the scale of computing and
storage systems has much trouble keeping up with such a rate of explosive data growth. A …

被引用次数：29 相关文章所有 3 个版本

[PDF] sciencedirect.com

Faster cloud star joins with reduced disk spill and network communication

JJ Brito, T Mosqueiro, RR Ciferri… - Procedia Computer …, 2016 - Elsevier

Combining powerful parallel frameworks and on-demand commodity hardware, cloud
computing has made both analytics and decision support systems canonical to enterprises …

被引用次数：20 相关文章所有 8 个版本

Defining energy consumption plans for data querying processes

R Gonçalves, J Saraiva, O Belo - 2014 IEEE Fourth …, 2014 - ieeexplore.ieee.org

During the last few years, we have been witnessing a significant increase in research about
the development and production of hardware and software components with low levels of …

被引用次数：16 相关文章所有 3 个版本

A distributed incremental information acquisition model for large-scale text data

S Sun, J Gong, AY Zomaya, A Wu - Cluster computing, 2019 - Springer

Timely discovering and acquiring information from incremental data on the Internet is a hot
topic in a big data era. This paper presents a distributed incremental information acquisition …

被引用次数：13 相关文章所有 3 个版本

[PDF] ict.ac.cn

[PDF][PDF] 面向神威· 太湖之光的PETSc 可扩展异构并行算法及其性能优化

洪文杰，李肯立，全哲，阳王东，李克勤，郝子宇… - 计算机学报, 2017 - cjc.ict.ac.cn

摘要共性数学库PETSc (Portable, Extensible Toolkit for Scientific Computation)
是高性能计算的基础模块, 是超级计算机计算环境的基础算法库之一, 其性能直接影响调用数学 …

被引用次数：3 相关文章所有 4 个版本

A real-time scheduling strategy based on processing framework of Hadoop

F Chen, J Liu, Y Zhu - 2017 IEEE International Congress on Big …, 2017 - ieeexplore.ieee.org

Due to the batch processing capability and distributed storage, MapReduce and HDFS have
always been the core parts of Hadoop. Nowadays, many studies still focus on improving and …

被引用次数：8 相关文章所有 2 个版本

Single‐scan: a fast star‐join query processing algorithm

V Purdilă, ŞG Pentiuc - Software: Practice and Experience, 2016 - Wiley Online Library

A data warehouse can store very large amounts of data that should be processed in parallel
in order to achieve reasonable query execution times. The MapReduce programming model …

被引用次数：9 相关文章所有 4 个版本

高级搜索

QQ 群