[HTML][HTML] A survey on addressing high-class imbalance in big data

JL Leevy, TM Khoshgoftaar, RA Bauder, N Seliya - Journal of Big Data, 2018 - Springer
In a majority–minority classification problem, class imbalance in the dataset (s) can
dramatically skew the performance of classifiers, introducing a prediction bias for the …

When Gaussian process meets big data: A review of scalable GPs

H Liu, YS Ong, X Shen, J Cai - IEEE transactions on neural …, 2020 - ieeexplore.ieee.org
The vast quantity of information brought by big data as well as the evolving computer
hardware encourages success stories in the machine learning community. In the …

SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

A Fernández, S Garcia, F Herrera, NV Chawla - Journal of artificial …, 2018 - jair.org
The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is
considered" de facto" standard in the framework of learning from imbalanced data. This is …

[HTML][HTML] A survey and analysis of intrusion detection models based on cse-cic-ids2018 big data

JL Leevy, TM Khoshgoftaar - Journal of Big Data, 2020 - Springer
The exponential growth in computer networks and network applications worldwide has been
matched by a surge in cyberattacks. For this reason, datasets such as CSE-CIC-IDS2018 …

[HTML][HTML] Big Data technologies: A survey

A Oussous, FZ Benjelloun, AA Lahcen… - Journal of King Saud …, 2018 - Elsevier
Abstract Developing Big Data applications has become increasingly important in the last few
years. In fact, several organizations from different sectors depend increasingly on …

Learning from class-imbalanced data: Review of methods and applications

G Haixiang, L Yijing, J Shang, G Mingyun… - Expert systems with …, 2017 - Elsevier
Rare events, especially those that could potentially negatively impact society, often require
humans' decision-making responses. Detecting rare events can be viewed as a prediction …

[HTML][HTML] Challenges to use machine learning in agricultural big data: a systematic literature review

A Cravero, S Pardo, S Sepúlveda, L Muñoz - Agronomy, 2022 - mdpi.com
Agricultural Big Data is a set of technologies that allows responding to the challenges of the
new data era. In conjunction with machine learning, farmers can use data to address …

[HTML][HTML] Learning from imbalanced data: open challenges and future directions

B Krawczyk - Progress in artificial intelligence, 2016 - Springer
Despite more than two decades of continuous development learning from imbalanced data
is still a focus of intense research. Starting as a problem of skewed distributions of binary …

A survey of data partitioning and sampling methods to support big data analysis

MS Mahmud, JZ Huang, S Salloum… - Big Data Mining and …, 2020 - ieeexplore.ieee.org
Computer clusters with the shared-nothing architecture are the major computing platforms
for big data processing and analysis. In cluster computing, data partitioning and sampling …

[HTML][HTML] Big data preprocessing: methods and prospects

S García, S Ramírez-Gallego, J Luengo, JM Benítez… - Big data analytics, 2016 - Springer
The massive growth in the scale of data has been observed in recent years being a key
factor of the Big Data scenario. Big Data can be defined as high volume, velocity and variety …