A comprehensive study and review of tuning the performance on database scalability in big data analytics

MR Sundarakumar, G Mahadevan… - Journal of Intelligent …, 2023 - content.iospress.com
In the modern era, digital data processing with a huge volume of data from the repository is
challenging due to various data formats and the extraction techniques available. The …

[HTML][HTML] A classification framework for straggler mitigation and management in a heterogeneous Hadoop cluster: A state-of-art survey

KL Bawankule, RK Dewang, AK Singh - Journal of King Saud University …, 2022 - Elsevier
Hadoop is the most economical and cheap software framework that allows distributed
storage and parallel processing of more extensive data sets. Hadoop distributed file system …

Heterogeneous architectures for big data batch processing in mapreduce paradigm

M Goudarzi - IEEE Transactions on Big Data, 2017 - ieeexplore.ieee.org
The amount of digital data produced worldwide is exponentially growing. While the source of
this data, collectively known as Big Data, varies from among mobile services to cyber …

Using distributed data over HBase in big data analytics platform for clinical services

D Chrimes, H Zamani - Computational and mathematical …, 2017 - Wiley Online Library
Big data analytics (BDA) is important to reduce healthcare costs. However, there are many
challenges of data aggregation, maintenance, integration, translation, analysis, and …

{MinFlow}: High-performance and Cost-efficient Data Passing for {I/O-intensive} Stateful Serverless Analytics

T Li, Y Li, W Zhu, Y Xu, JCS Lui - 22nd USENIX Conference on File and …, 2024 - usenix.org
Serverless computing has revolutionized application deployment, obviating traditional
infrastructure management and dynamically allocating resources on demand. A significant …

A counter based approach for reducer placement with augmented Hadoop rackawareness

MW Hussain, KH REDDY… - Turkish Journal of …, 2021 - journals.tubitak.gov.tr
As the data-driven paradigm for intelligent systems design is gaining prominence,
performance requirements have become very stringent, leading to numerous fine-tuned …

A survey of big data machine learning applications optimization in cloud data centers and networks

SH Mohamed, TEH El-Gorashi… - arXiv preprint arXiv …, 2019 - arxiv.org
This survey article reviews the challenges associated with deploying and optimizing big data
applications and machine learning algorithms in cloud data centers and networks. The …

A counter-based profiling scheme for improving locality through data and reducer placement

MW Hussain, DS Roy - Advances in Machine Learning for Big Data …, 2022 - Springer
Hadoop has been regarded as the de-facto standard for handling data-intensive distributed
applications with its popular storage and processing engine called as the Hadoop …

An Approach in Big Data Analytics to Improve the Velocity of Unstructured Data Using MapReduce

MR Sundarakumar, G Mahadevan… - International Journal of …, 2021 - igi-global.com
Abstract Big Data Analytics is an innovative approach for extracting the data from a huge
volume of data warehouse systems. It reveals the method to compress the high volume of …

Faster mapreduce computation on clouds through better performance estimation

SM Nabavinejad, M Goudarzi - IEEE Transactions on Cloud …, 2017 - ieeexplore.ieee.org
Processing Big Data in cloud is on the increase. An important issue for efficient execution of
Big Data processing jobs on a cloud platform is selecting the best fitting virtual machine (VM) …