The rise of “big data” on cloud computing: Review and open research issues

IAT Hashem, I Yaqoob, NB Anuar, S Mokhtar, A Gani… - Information systems, 2015 - Elsevier
Cloud computing is a powerful technology to perform massive-scale and complex
computing. It eliminates the need to maintain expensive computing hardware, dedicated …

Pagrol: Parallel graph olap over large-scale attributed graphs

Z Wang, Q Fan, H Wang, KL Tan… - 2014 IEEE 30th …, 2014 - ieeexplore.ieee.org
Attributed graphs are becoming important tools for modeling information networks, such as
the Web and various social networks (eg Facebook, LinkedIn, Twitter). However, it is …

Lightweight Materialization for Fast Dashboards Over Joins

Z Huang, E Wu - Proceedings of the ACM on Management of Data, 2023 - dl.acm.org
Dashboards are vital in modern business intelligence tools, providing non-technical users
with an interface to access comprehensive business data. With the rise of cloud technology …

Big data analysis on clouds

L Belcastro, F Marozzo, D Talia, P Trunfio - Handbook of big data …, 2017 - Springer
The huge amount of data generated, the speed at which it is produced, and its heterogeneity
in terms of format, represent a challenge to the current storage, process and analysis …

[PDF][PDF] Challenges in Big Data Cloud Computing And Future Research Prospects: A Review: A Review

SK Majhi, G Shial - SmartCR, 2015 - researchgate.net
Data management is becoming an increasingly important component of analytics in the age
of BigData. Because of the data generated by the number of companies is increasing in …

Multidimensional array data management

F Rusu - Foundations and Trends® in Databases, 2023 - nowpublishers.com
Multidimensional arrays are a fundamental abstraction to represent data across scientific
domains ranging from astronomy to genetics, medicine, business intelligence, and …

A simple low cost parallel architecture for big data analytics

C Ordonez, ST Al-Amin, X Zhou - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Big Data Systems (Hadoop, DBMSs) require a complicated setup and tuning to store and
process big data on a parallel cluster. This is mainly due to static partitioning when data sets …

Distributed graph cube generation using Spark framework

S Kang, S Lee, J Kim - The Journal of Supercomputing, 2020 - Springer
Graph OLAP is a technology that generates aggregates or summaries of a large-scale graph
based on the properties (or dimensions) associated with its nodes and edges, and in turn …

Calibration: A Simple Trick for Wide-table Delta Analytics

Z Huang, E Wu - arXiv preprint arXiv:2210.03851, 2022 - arxiv.org
Data analytics over normalized databases typically requires computing and materializing
expensive joins (wide-tables). Factorized query execution models execution as message …

Scalable distributed data cube computation for large-scale multidimensional data analysis on a Spark cluster

S Lee, S Kang, J Kim, EJ Yu - Cluster Computing, 2019 - Springer
A data cube is a powerful analytical tool that stores all aggregate values over a set of
dimensions. It provides users with a simple and efficient means of performing complex data …