[HTML][HTML] A classification framework for straggler mitigation and management in a heterogeneous Hadoop cluster: A state-of-art survey

KL Bawankule, RK Dewang, AK Singh - Journal of King Saud University …, 2022 - Elsevier
Hadoop is the most economical and cheap software framework that allows distributed
storage and parallel processing of more extensive data sets. Hadoop distributed file system …

Historical data based approach to mitigate stragglers from the Reduce phase of MapReduce in a heterogeneous Hadoop cluster

KL Bawankule, RK Dewang, AK Singh - Cluster Computing, 2022 - Springer
Hadoop MapReduce processes data on the cluster of commodity hardware (node) in two
phases using Map and Reduce tasks. Yet another resource negotiator (YARN), a dynamic …

Historical data based approach for straggler avoidance in a heterogeneous Hadoop cluster

KL Bawankule, RK Dewang, AK Singh - Journal of Ambient Intelligence …, 2021 - Springer
Cloud computing has emerged as a new way of sharing resources. MapReduce has
become the de facto standard for cloud computing, which helps for data-intensive …

Intelligent data compression policy for Hadoop performance optimization

A Ashu, MW Hussain, D Sinha Roy… - Proceedings of the 11th …, 2021 - Springer
Hadoop can deal with Zeta-level data, but the huge request for Disk I/O and Network
utilization often appears as the limitations in Hadoop. During different job execution phases …

A counter based approach for reducer placement with augmented Hadoop rackawareness

MW Hussain, KH REDDY… - Turkish Journal of …, 2021 - journals.tubitak.gov.tr
As the data-driven paradigm for intelligent systems design is gaining prominence,
performance requirements have become very stringent, leading to numerous fine-tuned …

A counter-based profiling scheme for improving locality through data and reducer placement

MW Hussain, DS Roy - Advances in Machine Learning for Big Data …, 2022 - Springer
Hadoop has been regarded as the de-facto standard for handling data-intensive distributed
applications with its popular storage and processing engine called as the Hadoop …

Enabling indirect link discovery between SDN switches

MW Hussain, D Sinha Roy - … of the International Conference on Computing …, 2021 - Springer
The removal of the control plane from a Software Defined Network (SDN) helps avoid
flexibility issues that exist in the traditional networks thus enabling SDN to leverage more …

Improving big data analytics data processing speed through map reduce scheduling and replica placement with HDFS using genetic optimization techniques

MR Sundara Kumar, HS Mohan - Journal of Intelligent & …, 2024 - content.iospress.com
Abstract Big Data Analytics (BDA) is an unavoidable technique in today's digital world for
dealing with massive amounts of digital data generated by online and internet sources. It is …

A machine learning approach for daily temperature prediction using big data

U Divakarla, K Chandrasekaran, KHK Reddy… - Inventive Systems and …, 2022 - Springer
Due to global warming, weather forecasting becomes complex problem which is affected by
a lot of factors like temperature, wind speed, humidity, year, month, day, etc. weather …

Effective Resource Utilization in Heterogeneous Hadoop Environment Through a Dynamic Inter-cluster and Intra-cluster Load Balancing

E Hosni, W Chaari, N Kolsi, K Ghedira - Asian Conference on Intelligent …, 2022 - Springer
Apache Hadoop is one of the most popular distributed computing systems, used largely for
big data analysis and processing. The Hadoop cluster hosts multiple parallel workloads …