A survey of distributed data stream processing frameworks

H Isah, T Abughofa, S Mahfuz, D Ajerla… - IEEE …, 2019 - ieeexplore.ieee.org
Big data processing systems are evolving to be more stream oriented where each data
record is processed as it arrives by distributed and low-latency computational frameworks on …

Big data systems: A software engineering perspective

A Davoudian, M Liu - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Big Data Systems (BDSs) are an emerging class of scalable software technologies whereby
massive amounts of heterogeneous data are gathered from multiple sources, managed …

Complex event recognition in the big data era: a survey

N Giatrakos, E Alevizos, A Artikis, A Deligiannakis… - The VLDB Journal, 2020 - Springer
The concept of event processing is established as a generic computational paradigm in
various application fields. Events report on state changes of a system and its environment …

A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench

N Ahmed, ALC Barczak, T Susnjak, MA Rashid - Journal of Big Data, 2020 - Springer
Big Data analytics for storing, processing, and analyzing large-scale datasets has become
an essential tool for the industry. The advent of distributed computing frameworks such as …

Evaluation of stream processing frameworks

G Van Dongen, D Van den Poel - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
The increasing need for real-time insights in data sparked the development of multiple
stream processing frameworks. Several benchmarking studies were conducted in an effort to …

A configurable method for benchmarking scalability of cloud-native applications

S Henning, W Hasselbring - Empirical Software Engineering, 2022 - Springer
Cloud-native applications constitute a recent trend for designing large-scale software
systems. However, even though several cloud-native tools and patterns have emerged to …

The nebulastream platform: Data and application management for the internet of things

S Zeuch, A Chaudhary, B Del Monte… - arXiv preprint arXiv …, 2019 - arxiv.org
The Internet of Things (IoT) presents a novel computing architecture for data management: a
distributed, highly dynamic, and heterogeneous environment of massive scale. Applications …

Theodolite: Scalability benchmarking of distributed stream processing engines in microservice architectures

S Henning, W Hasselbring - Big Data Research, 2021 - Elsevier
Distributed stream processing engines are designed with a focus on scalability to process
big data volumes in a continuous manner. We present the Theodolite method for …

Rhino: Efficient management of very large distributed state for stream processing engines

B Del Monte, S Zeuch, T Rabl, V Markl - Proceedings of the 2020 ACM …, 2020 - dl.acm.org
Scale-out stream processing engines (SPEs) are powering large big data applications on
high velocity data streams. Industrial setups require SPEs to sustain outages, varying data …

Dspbench: A suite of benchmark applications for distributed data stream processing systems

MV Bordin, D Griebler, G Mencagli, CFR Geyer… - IEEE …, 2020 - ieeexplore.ieee.org
Systems enabling the continuous processing of large data streams have recently attracted
the attention of the scientific community and industrial stakeholders. Data Stream Processing …