PRBP: A prioritized replica balancing policy for HDFS balancer

RWA Fazul, PP Barcelos - Software: Practice and Experience, 2023 - Wiley Online Library
Data replication is the main fault tolerance mechanism implemented by the Apache Hadoop
Distributed File System (HDFS). The placement of the data across the cluster directly affects …

[PDF][PDF] An efficient approach for bigdata security based on Hadoop system using cryptographic techniques

S Gattoju, V Nagalakshmi - Indian Journal of Computer Science and …, 2021 - ijcse.com
When relational database systems could no longer keep up with the huge amounts of
unstructured data created by organizations, social media, and all other data-generating …

Faulty Node Detection in HDFS Using Machine Learning Techniques.

RS Gaykar, V Khanaa, SD Joshi - Revue d'Intelligence …, 2022 - search.ebscohost.com
The design of Hadoop has ability to elimination of fault tolerance, which consists of
rescheduling the task on the defective nodes to run on other devices in the system …

Automation and prioritization of replica balancing in HDFS

RWA Fazul, PP Barcelos - Proceedings of the 36th Annual ACM …, 2021 - dl.acm.org
The Hadoop Distributed File System (HDFS) is a reliable storage engine designed to run
over commodity hardware. To provide reliability and read performance, HDFS has a storage …

An event-driven strategy for reactive replica balancing on apache hadoop distributed file system

RWA Fazul, PP Barcelos - Proceedings of the 37th ACM/SIGAPP …, 2022 - dl.acm.org
Distributed file systems are essential to support applications that handle large volumes of
data. One of the most widely used distributed file systems is the HDFS, the Apache Hadoop's …

[PDF][PDF] Data recovery approach with optimized Cauchy coding in distributed storage system

S Funde, G Swain - … Journal of Advanced Computer Science and …, 2022 - academia.edu
In the professional world, the impact of big data is pulsating to change things. Data is
currently generated by a wide range of sensors that are part of smart devices. It necessitates …

Estimación del rendimiento de arquitectura homogénea y/o heterogénea para big data

CIH Bravo, AAV Ramirez… - Revista de Investigación …, 2023 - revistas.utea.edu.pe
La cuarta revolución industrial interactúa con otras vertientes como Cloud Computing,
Internet de las Cosas, Ciencia de Datos, Ingeniero de datos, Inteligencia Artificial con …

Load balancing algorithms with cluster in cloud environment

SB Kshama, KR Shobha - International Journal of …, 2022 - inderscienceonline.com
Load balancing is one of the important aspects of cloud computing. Its main goal is to
improve system performance and to reduce its cost. Cloud computing has a dedicated load …

A Framework for Big Data Security Using MapReduce in IoT Enabled Computing

K Kala, K Makhloga, A Khan… - 2024 2nd International …, 2024 - ieeexplore.ieee.org
With the increase in number of IoT devices day by day, a large amount of unstructured,
structured and semi-structured data is being generated, collectively termed as big data. The …

A HDFS dynamic load balancing strategy using improved niche PSO algorithm in cloud storage

Z Jian, Y Jian - International Journal of Autonomous and …, 2021 - inderscienceonline.com
A Hadoop distributed file system (HDFS) NameNode dynamic load balancing strategy
(NDLBT) using improved niche particle swarm optimisation (PSO) is proposed, which is …