[HTML][HTML] MapReduce scheduling algorithms in Hadoop: a systematic study

S Hedayati, N Maleki, T Olsson, F Ahlgren… - Journal of Cloud …, 2023 - Springer
Hadoop is a framework for storing and processing huge volumes of data on clusters. It uses
Hadoop Distributed File System (HDFS) for storing data and uses MapReduce to process …

On the big data processing algorithms for finding frequent sequences

AB Can, M Zaval, M Uzun‐Per… - … Practice and Experience, 2023 - Wiley Online Library
Sequential pattern mining algorithms extract trendy sequence appearances inside ordered
transactional datasets such as market basket datasets. There is a lack of research …

[HTML][HTML] Online task scheduling of big data applications in the cloud environment

L Bouhouch, M Zbakh, C Tadonki - Information, 2023 - mdpi.com
The development of big data has generated data-intensive tasks that are usually time-
consuming, with a high demand on cloud data centers for hosting big data applications. It …

Analysis of Large SARS-CoV-2 Data using Scalable Genetic Algorithm with Enhanced Bi-LSTM Method

U Singh, A Raundale - International Journal of Intelligent Systems and …, 2023 - ijisae.org
Abstract Corona Virus Disease 2019 (COVID-19), caused by the Severe Acute Respiratory
Syndrome Coronavirus-2 (SARS-CoV-2) virus, which emerged in late 2019, is now …

Apache Spark-based scalable feature extraction approaches for protein sequence and their clustering performance analysis

P Jha, A Tiwari, N Bharill, M Ratnaparkhe… - International Journal of …, 2023 - Springer
Genome sequencing projects are rapidly contributing to the rise of high-dimensional protein
sequence datasets. Extracting features from a high-dimensional protein sequence dataset …

[PDF][PDF] Online Task Scheduling of Big Data Applications in the Cloud Environment. Information 2023, 14, 292

L Bouhouch, M Zbakh, C Tadonki - 2023 - academia.edu
The development of big data has generated data-intensive tasks that are usually
timeconsuming, with a high demand on cloud data centers for hosting big data applications …

Scalable and robust big data clustering with adaptive local feature weighting based on the Map-Reduce and Hadoop

M Mohammadi, A Shokrollahi, M Reisi… - 2023 - researchsquare.com
Fuzzy c-means (FCM) is an effective clustering algorithm, which has been successfully
applied on many real-world applications. Although, FCM and its improvements have …

Characteristics of Dust Aerosol Properties Using CALIOP and Thermal Infrared Satellite Observations

J Zheng - 2023 - search.proquest.com
Mineral dust aerosol transport in the atmosphere impacts the radiation budget of Earth,
cloud formations, ocean and terrestrial biogeochemical processes, visibility and human …

[PDF][PDF] Fully Dynamic Maximal Independent Set in Polylogarithmic Update Time

B Krekelberg - pure.tue.nl
Abstract The Maximal Independent Set (MIS) problem is a fundamental problem in
theoretical computer science and combinatorial optimization, aiming to find a set of mutually …