A survey on networked data streaming with apache kafka

TP Raptis, A Passarella - IEEE access, 2023 - ieeexplore.ieee.org
Apache Kafka has become a popular solution for managing networked data streaming in a
variety of applications, from industrial to general purpose. This paper systematically surveys …

Big data stream processing

OC Marcu, P Bouvry - 2024 - hal.science
This chapter provides students, industry experts, and researchers a high-level and
comprehensive overview of the end-to-end architectures of big data stream processing …

[HTML][HTML] Kafka-ML: Connecting the data stream with ML/AI frameworks

C Martín, P Langendoerfer, PS Zarrin, M Díaz… - Future Generation …, 2022 - Elsevier
Abstract Machine Learning (ML) and Artificial Intelligence (AI) depend on data sources to
train, improve, and make predictions through their algorithms. With the digital revolution and …

Challenges and solutions for processing real-time big data stream: a systematic literature review

E Mehmood, T Anees - IEEE Access, 2020 - ieeexplore.ieee.org
Contribution: Recently, real-time data warehousing (DWH) and big data streaming have
become ubiquitous due to the fact that a number of business organizations are gearing up to …

A survey on classifying big data with label noise

JM Johnson, TM Khoshgoftaar - ACM Journal of Data and Information …, 2022 - dl.acm.org
Class label noise is a critical component of data quality that directly inhibits the predictive
performance of machine learning algorithms. While many data-level and algorithm-level …

[PDF][PDF] Machine Learning Based Approach to Anomaly and Cyberattack Detection in Streamed Network Traffic Data.

M Komisarek, M Pawlicki, R Kozik… - J. Wirel. Mob. Networks …, 2021 - isyou.info
In this paper, the performance of a solution providing stream processing is evaluated, and its
accuracy in the classification of suspicious flows in simulated network traffic is investigated …

A new Apache Spark-based framework for big data streaming forecasting in IoT networks

AM Fernández-Gómez, D Gutiérrez-Avilés… - The Journal of …, 2023 - Springer
Analyzing time-dependent data acquired in a continuous flow is a major challenge for
various fields, such as big data and machine learning. Being able to analyze a large volume …

[HTML][HTML] AIDA—A holistic AI-driven networking and processing framework for industrial IoT applications

H Chahed, M Usman, A Chatterjee, F Bayram… - Internet of Things, 2023 - Elsevier
Industry 4.0 is characterized by digitalized production facilities, where a large volume of
sensors collect a vast amount of data that is used to increase the sustainability of the …

A systematic mapping of performance in distributed stream processing systems

A Vogel, S Henning, O Ertl… - 2023 49th Euromicro …, 2023 - ieeexplore.ieee.org
Several software systems are built upon stream processing architectures to process large
amounts of data in near real-time. Today's distributed stream processing systems (DSPSs) …

Application-aware resource allocation and data management for MEC-assisted IoT service providers

S Bolettieri, R Bruno, E Mingozzi - Journal of Network and Computer …, 2021 - Elsevier
To support the growing demand for data-intensive and low-latency IoT applications, Multi-
Access Edge Computing (MEC) is emerging as an effective edge-computing approach …