Testing MapReduce programs: A systematic mapping study

J Morán, C de la Riva, J Tuya - Journal of Software: Evolution …, 2019 - Wiley Online Library
Context MapReduce is a processing model used in Big Data to facilitate the analysis of large
data under a distributed architecture. Objective The aim of this study is to identify and …

Big Data testing techniques: taxonomy, challenges and future trends

I Arshad, SH Alsamhi, W Afzal - arXiv preprint arXiv:2111.02853, 2021 - arxiv.org
Big Data is reforming many industrial domains by providing decision support through
analyzing large data volumes. Big Data testing aims to ensure that Big Data systems run …

A temporal based approach for MapReduce distributed testing

S Hsaini, S Azzouzi, MEH Charaf - International Journal of Parallel …, 2021 - Taylor & Francis
Over the last few years, there has been a rising trend towards the field of distributed testing
where the implementation under test (IUT) has physical distributed ports. However, running …

Automatic testing of design faults in mapreduce applications

J Morán, A Bertolino, C de la Riva… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
New processing models are being adopted in Big Data engineering to overcome the
limitations of traditional technology. Among them, MapReduce stands out by allowing for the …

Testing data transformations in MapReduce programs

J Morán, C Riva, J Tuya - Proceedings of the 6th International Workshop …, 2015 - dl.acm.org
MapReduce is a parallel data processing paradigm oriented to process large volumes of
information in data-intensive applications, such as Big Data environments. A characteristic of …

Towards ex vivo testing of mapreduce applications

J Morán, A Bertolino, C de la Riva… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Big Data programs are those that process large data exceeding the capabilities of traditional
technologies. Among newly proposed processing models, MapReduce stands out as it …

Automatic Debugging of Design Faults in MapReduce Applications

J Morán, A Bertolino, C de la Riva… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Among the current technologies to analyse large data, the MapReduce processing model
stands out in Big Data. MapReduce is implemented in frameworks such as Hadoop, Spark …

TRANSMUT‐Spark: Transformation mutation for Apache Spark

JB de Souza Neto, A Martins Moreira… - Software Testing …, 2022 - Wiley Online Library
This paper proposes transmut‐Spark for automating mutation testing of big data processing
code within Spark programs. Apache Spark is an engine for big data analytics/processing …

Infrastructure-aware functional testing of mapreduce programs

J Morán, B Rivas, C De La Riva, J Tuya… - 2016 IEEE 4th …, 2016 - ieeexplore.ieee.org
Programs that process a large volume of data generally run in a distributed and parallel
architecture, such as the programs implemented in the processing model MapReduce. In …

TRANSMUT-SPARK: Transformation Mutation for Apache Spark

JBS Neto, AM Moreira, G Vargas-Solar… - arXiv preprint arXiv …, 2021 - arxiv.org
We propose TRANSMUT-Spark, a tool that automates the mutation testing process of Big
Data processing code within Spark programs. Apache Spark is an engine for Big Data …