SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

A Fernández, S Garcia, F Herrera, NV Chawla - Journal of artificial …, 2018 - jair.org
The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is
considered" de facto" standard in the framework of learning from imbalanced data. This is …

On the joint-effect of class imbalance and overlap: a critical review

MS Santos, PH Abreu, N Japkowicz… - Artificial Intelligence …, 2022 - Springer
Current research on imbalanced data recognises that class imbalance is aggravated by
other data intrinsic characteristics, among which class overlap stands out as one of the most …

On the class overlap problem in imbalanced data classification

P Vuttipittayamongkol, E Elyan, A Petrovski - Knowledge-based systems, 2021 - Elsevier
Class imbalance is an active research area in the machine learning community. However,
existing and recent literature showed that class overlap had a higher negative impact on the …

Review of classification methods on unbalanced data sets

L Wang, M Han, X Li, N Zhang, H Cheng - Ieee Access, 2021 - ieeexplore.ieee.org
This paper studies the classification of unbalanced data sets. First, this kind of data sets is
briefly introduced, and then the classification methods of unbalanced data sets are analyzed …

Enhanced credit card fraud detection model using machine learning

NS Alfaiz, SM Fati - Electronics, 2022 - mdpi.com
The COVID-19 pandemic has limited people's mobility to a certain extent, making it difficult
to purchase goods and services offline, which has led the creation of a culture of increased …

A hybrid method with dynamic weighted entropy for handling the problem of class imbalance with overlap in credit card fraud detection

Z Li, M Huang, G Liu, C Jiang - Expert Systems with Applications, 2021 - Elsevier
Class imbalance with overlap is a very challenging problem in electronic fraud transaction
detection. Fraudsters have racked their brains to make a fraud transaction as similar as a …

Neighbourhood-based undersampling approach for handling imbalanced and overlapped data

P Vuttipittayamongkol, E Elyan - Information Sciences, 2020 - Elsevier
Class imbalanced datasets are common across different domains including health, security,
banking and others. A typical supervised learning algorithm tends to be biased towards the …

Prediction model of rock mass class using classification and regression tree integrated AdaBoost algorithm based on TBM driving data

Q Liu, X Wang, X Huang, X Yin - Tunnelling and Underground Space …, 2020 - Elsevier
The real-time acquisition of surrounding rock information is important for the efficient
tunneling and hazard prevention in tunnel boring machines (TBMs). This study presents an …

An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics

V López, A Fernández, S García, V Palade… - Information sciences, 2013 - Elsevier
Training classifiers with datasets which suffer of imbalanced class distributions is an
important problem in data mining. This issue occurs when the number of examples …

[HTML][HTML] RN-SMOTE: Reduced Noise SMOTE based on DBSCAN for enhancing imbalanced data classification

A Arafa, N El-Fishawy, M Badawy, M Radad - Journal of King Saud …, 2022 - Elsevier
Abstract Machine learning classifiers perform well on balanced datasets. Unfortunately, a lot
of the real-world data sets are naturally imbalanced. So, imbalanced classification is a …