A comparison of undersampling, oversampling, and SMOTE methods for dealing with imbalanced classification in educational data mining

T Wongvorachan, S He, O Bulut - Information, 2023 - mdpi.com
Educational data mining is capable of producing useful data-driven applications (eg, early
warning systems in schools or the prediction of students' academic achievement) based on …

Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning

H Han, WY Wang, BH Mao - International conference on intelligent …, 2005 - Springer
In recent years, mining with imbalanced data sets receives more and more attentions in both
theoretical and practical aspects. This paper introduces the importance of imbalanced data …

An empirical comparison and evaluation of minority oversampling techniques on a large number of imbalanced datasets

G Kovács - Applied Soft Computing, 2019 - Elsevier
Learning and mining from imbalanced datasets gained increased interest in recent years.
One simple but efficient way to increase the performance of standard machine learning …

A new over-sampling approach: random-SMOTE for learning from imbalanced data sets

Y Dong, X Wang - … and Management: 5th International Conference, KSEM …, 2011 - Springer
For imbalanced data sets, examples of minority class are sparsely distributed in sample
space compared with the overwhelming amount of majority class. This presents a great …

Two density-based sampling approaches for imbalanced and overlapping data

S Mayabadi, H Saadatfar - Knowledge-Based Systems, 2022 - Elsevier
An imbalanced dataset consists of a majority class and a minority class, where the former's
sample size is substantially larger than other classes. This difference disrupts the data …

Selective oversampling approach for strongly imbalanced data

P Gnip, L Vokorokos, P Drotár - PeerJ Computer Science, 2021 - peerj.com
Challenges posed by imbalanced data are encountered in many real-world applications.
One of the possible approaches to improve the classifier performance on imbalanced data is …

An improved and random synthetic minority oversampling technique for imbalanced data

G Wei, W Mu, Y Song, J Dou - Knowledge-based systems, 2022 - Elsevier
Imbalanced data learning has become a major challenge in data mining and machine
learning. Oversampling is an effective way to re-achieve the balance by generating new …

Over-sampling algorithm for imbalanced data classification

XU Xiaolong, C Wen, SUN Yanfei - Journal of Systems …, 2019 - ieeexplore.ieee.org
For imbalanced datasets, the focus of classification is to identify samples of the minority
class. The performance of current data mining algorithms is not good enough for processing …

Entropy-based sampling approaches for multi-class imbalanced problems

L Li, H He, J Li - IEEE Transactions on Knowledge and Data …, 2019 - ieeexplore.ieee.org
In data mining, large differences between multi-class distributions regarded as class
imbalance issues have been known to hinder the classification performance. Unfortunately …

Learning imbalanced datasets based on SMOTE and Gaussian distribution

T Pan, J Zhao, W Wu, J Yang - Information Sciences, 2020 - Elsevier
The learning of imbalanced datasets is a ubiquitous challenge for researchers in the fields of
data mining and machine learning. Conventional classifiers are often biased towards the …