Benchmarking dependability of mapreduce systems

A Sangroya, D Serrano… - 2012 IEEE 31st …, 2012 - ieeexplore.ieee.org
MapReduce is a popular programming model for distributed data processing. Extensive
research has been conducted on the reliability of MapReduce, ranging from adaptive and on …

[PDF][PDF] Availability of Jobtracker machine in hadoop/mapreduce zookeeper coordinated clusters

E Okorafor, MK Patrick - Advanced Computing, 2012 - Citeseer
It is difficult to use the traditional Message Passing Interface (MPI) approach to implement
synchronization, coordination, and prevent deadlocks in distributed systems. This difficulty is …

MRBS: towards dependability benchmarking for Hadoop MapReduce

A Sangroya, D Serrano, S Bouchenak - Euro-Par 2012: Parallel …, 2013 - Springer
MapReduce is a popular programming model for distributed data processing. Extensive
research has been conducted on the reliability of MapReduce, ranging from adaptive and on …

Experience with benchmarking dependability and performance of MapReduce systems

A Sangroya, S Bouchenak, D Serrano - Performance Evaluation, 2016 - Elsevier
MapReduce provides a convenient means for distributed data processing and automatic
parallel execution on clusters of machines. It has various applications and is used by several …

Open reading frame phylogenetic analysis on the cloud

CL Hung, CY Lin - International Journal of Genomics, 2013 - Wiley Online Library
Phylogenetic analysis has become essential in researching the evolutionary relationships
between viruses. These relationships are depicted on phylogenetic trees, in which viruses …

An Enhanced Distributed Algorithm for Area Skyline Computation Based on Apache Spark

C Li, Y Cao, Y Zhu, J Zhang, A Annisa, D Cheng… - … on Knowledge Science …, 2023 - Springer
Skyline computations are a way of finding the best data points based on multiple criteria for
location-based decision-making. However, as the input data grows larger, these …

A fault-tolerant environment for large-scale query processing

MC Kurt, G Agrawal - 2012 19th International Conference on …, 2012 - ieeexplore.ieee.org
As datasets are increasing in size, the data management and processing needs are being
met with added parallelism, ie, by involving more nodes and/or cores in the system. This, in …

HBase fine grained access control with extended permissions and inheritable roles

Y Lai, Q Qian - 2015 IEEE/ACIS 16th International Conference …, 2015 - ieeexplore.ieee.org
HBase is a widely used distributed and column-oriented database based on HADOOP and
HDFS. But it still has some shortcomings in storing and sharing data. In order to upgrade the …

Mining Area Skyline Objects from Map-based Big Data using Apache Spark Framework

C Li, Y Zhu, Y Cao, J Zhang, A Annisa, D Cheng… - arXiv preprint arXiv …, 2024 - arxiv.org
The computation of the skyline provides a mechanism for utilizing multiple location-based
criteria to identify optimal data points. However, the efficiency of these computations …

Learner's satisfaction within a breast imaging eLearning course for radiographers

IC Moreira, SR Ventura, I Ramos… - Proceedings of the …, 2013 - ieeexplore.ieee.org
Background: An asynchronous eLearning system was developed for radiographers in order
to promote a better knowledge about senology and mammography. Objectives: to assess …