A comprehensive survey of anomaly detection techniques for high dimensional big data

S Thudumu, P Branch, J Jin, J Singh - Journal of Big Data, 2020 - Springer
Anomaly detection in high dimensional data is becoming a fundamental research problem
that has various applications in the real world. However, many existing anomaly detection …

Learning under concept drift: A review

J Lu, A Liu, F Dong, F Gu, J Gama… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Concept drift describes unforeseeable changes in the underlying distribution of streaming
data overtime. Concept drift research involves the development of methodologies and …

Ensemble learning: A survey

O Sagi, L Rokach - Wiley interdisciplinary reviews: data mining …, 2018 - Wiley Online Library
Ensemble methods are considered the state‐of‐the art solution for many machine learning
challenges. Such methods improve the predictive performance of a single model by training …

A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework

G Aguiar, B Krawczyk, A Cano - Machine learning, 2024 - Springer
Class imbalance poses new challenges when it comes to classifying data streams. Many
algorithms recently proposed in the literature tackle this problem using a variety of data …

Ensemble learning for data stream analysis: A survey

B Krawczyk, LL Minku, J Gama, J Stefanowski… - Information …, 2017 - Elsevier
In many applications of information systems learning algorithms have to act in dynamic
environments where data are collected in the form of transient data streams. Compared to …

Adaptive random forests for evolving data stream classification

HM Gomes, A Bifet, J Read, JP Barddal, F Enembreck… - Machine Learning, 2017 - Springer
Random forests is currently one of the most used machine learning algorithms in the non-
streaming (batch) setting. This preference is attributable to its high learning performance and …

A survey on ensemble learning for data stream classification

HM Gomes, JP Barddal, F Enembreck… - ACM Computing Surveys …, 2017 - dl.acm.org
Ensemble-based methods are among the most widely used techniques for data stream
classification. Their popularity is attributable to their good performance in comparison to …

Online incremental machine learning platform for big data-driven smart traffic management

D Nallaperuma, R Nawaratne… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
The technological landscape of intelligent transport systems (ITS) has been radically
transformed by the emergence of the big data streams generated by the Internet of Things …

Learning in nonstationary environments: A survey

G Ditzler, M Roveri, C Alippi… - IEEE Computational …, 2015 - ieeexplore.ieee.org
The prevalence of mobile phones, the internet-of-things technology, and networks of
sensors has led to an enormous and ever increasing amount of data that are now more …

A survey on concept drift adaptation

J Gama, I Žliobaitė, A Bifet, M Pechenizkiy… - ACM computing …, 2014 - dl.acm.org
Concept drift primarily refers to an online supervised learning scenario when the relation
between the input data and the target variable changes over time. Assuming a general …