A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework

G Aguiar, B Krawczyk, A Cano - Machine learning, 2023 - Springer
Class imbalance poses new challenges when it comes to classifying data streams. Many
algorithms recently proposed in the literature tackle this problem using a variety of data …

Online Machine Learning from Non-stationary Data Streams in the Presence of Concept Drift and Class Imbalance: A Systematic Review

AS Palli, J Jaafar, AR Gilal… - … of Information and …, 2024 - e-journal.uum.edu.my
In IoT environment applications generate continuous non-stationary data streams with in-
built problems of concept drift and class imbalance which cause classifier performance …

The L2 convergence of stream data mining algorithms based on probabilistic neural networks

D Rutkowska, P Duda, J Cao, L Rutkowski, A Byrski… - Information …, 2023 - Elsevier
This paper concerns a new incremental approach to mining data streams. It is known that
patterns in a data stream may evolve over time. In many cases, we need to track and …

Pro-IDD: Pareto-based ensemble for imbalanced and drifting data streams

M Usman, H Chen - Knowledge-Based Systems, 2023 - Elsevier
Abstract Concept drifts and class imbalance are two primary challenges in supervised data
stream classification, whereas their co-occurrence presents a more complicated learning …

A weighted ensemble classification algorithm based on nearest neighbors for multi-label data stream

H Wu, M Han, Z Chen, M Li, X Zhang - ACM Transactions on Knowledge …, 2023 - dl.acm.org
With the rapid development of data stream, multi-label algorithms for mining dynamic data
become more and more important. At the same time, when data distribution changes …

[HTML][HTML] Hybrid sampling and dynamic weighting-based classification method for multi-class imbalanced data stream

M Han, A Li, Z Gao, D Mu, S Liu - Applied Sciences, 2023 - mdpi.com
The imbalance and concept drift problems in data streams become more complex in multi-
class environment, and extreme imbalance and variation in class ratio may also exist. To …

Probabilistic neural networks for incremental learning over time-varying streaming data with application to air pollution monitoring

D Rutkowska, P Duda, J Cao, M Jaworski… - Applied Soft …, 2024 - Elsevier
This paper proposes a novel algorithm for incremental learning over streaming data in a non-
stationary environment. The idea refers to the applicability of Probabilistic Neural Networks …

Unsupervised concept drift detection method based on robust random cut forest

Z Pang, J Cen, M Yi - International Journal of Machine Learning and …, 2023 - Springer
The prevalence of streams in practical applications is rapidly increasing, making stream data
mining increasingly important. However, unlike the static datasets used in machine learning …

Evolvability of machine learning-based systems: An architectural design decision framework

J Leest, I Gerostathopoulos… - 2023 IEEE 20th …, 2023 - ieeexplore.ieee.org
The increasing integration of machine learning (ML) in modern software systems has lead to
new challenges as a result of the shift from human-determined behavior to data-determined …

[HTML][HTML] Online evaluation of the Kolmogorov–Smirnov test on arbitrarily large samples

DO Cardoso, TD Galeno - Journal of Computational Science, 2023 - Elsevier
This paper presents an approximative online algorithm to perform the Kolmogorov–Smirnov
test. There is a ubiquitous need for evaluating the fitness between statistical distributions …