Hybridtune: spatio-temporal performance data correlation for performance diagnosis of big data systems

R Ren, J Cheng, XW He, L Wang, JF Zhan… - Journal of Computer …, 2019 - Springer
With tremendous growing interests in Big Data, the performance improvement of Big Data
systems becomes more and more important. Among many steps, the first one is to analyze …

Generalizing streaming pipeline design for big data

K Rengarajan, VK Menon - Machine Intelligence and Signal Processing …, 2020 - Springer
Streaming data refers to the data that is sent to a cloud or a processing centre in real time.
Even though we have limited exposure to such applications that can process data streams …

Harmonizing Dimensionality: Unveiling the Prowess of Variational Auto-Encoder in Spark for Big Data Processing.

W Jawad, A Al-Bakry - Revue d'Intelligence Artificielle, 2024 - search.ebscohost.com
In the dynamic realm of big data processing, conquering the challenges imposed by
highdimensional datasets is imperative. This paper introduces a groundbreaking …

Towards a big data benchmarking and demonstration suite for the online social network era with realistic workloads and live data

R Zhang, I Manotas, M Li, D Hildebrand - … HI, USA, August 31-September 4 …, 2016 - Springer
The growing popularity of online social networks has taken big data analytics into uncharted
territories. Newly developed platforms and analytics in these environments are in dire need …

Does Big Data Require Complex Systems? A Performance Comparison Between Spark and Unicage Shell Scripts

DM Nascimento, M Ferreira, ML Pardal - arXiv preprint arXiv:2212.13647, 2022 - arxiv.org
The paradigm of big data is characterized by the need to collect and process data sets of
great volume, arriving at the systems with great velocity, in a variety of formats. Spark is a …

Benchmarking graph data management and processing systems: A survey

M Dayarathna, T Suzumura - arXiv preprint arXiv:2005.12873, 2020 - arxiv.org
The development of scalable, representative, and widely adopted benchmarks for graph
data systems have been a question for which answers has been sought for decades. We …

Big data analysis in financial markets

TJ Green - 2019 - search.proquest.com
This dissertation researched topics in the financial analysis of publicly traded companies.
The opportunity for this dissertation was to address the potential of increasing value for …

[PDF][PDF] Big data dwarfs: Towards fully understanding big data analytics workloads

W Gao, L Wang, J Zhan, C Luo, D Zheng… - arXiv preprint arXiv …, 2018 - academia.edu
Though the big data benchmark suites like BigDataBench and CloudSuite have been used
in architecture and system researches, we have not yet answered the fundamental issue …

An ontology-based conceptual modeling method for data warehouse

L He, Y Chen, N Meng, LY Liu - 2011 International Conference …, 2011 - ieeexplore.ieee.org
Accurate conceptual model is the foundation of data warehouse building. Studies show that
one of the main reasons why a data warehouse project ends up failure is that the modeling …

Towards a set of metrics to guide the generation of fake computer file systems

B Whitham - 2014 - ro.ecu.edu.au
Fake file systems are used in the field of cyber deception to bait intruders and fool forensic
investigators. File system researchers also frequently generate their own synthetic document …