SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

A Fernández, S Garcia, F Herrera, NV Chawla - Journal of artificial …, 2018 - jair.org
The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is
considered" de facto" standard in the framework of learning from imbalanced data. This is …

An empirical comparison and evaluation of minority oversampling techniques on a large number of imbalanced datasets

G Kovács - Applied Soft Computing, 2019 - Elsevier
Learning and mining from imbalanced datasets gained increased interest in recent years.
One simple but efficient way to increase the performance of standard machine learning …

RWO-Sampling: A random walk over-sampling approach to imbalanced data classification

H Zhang, M Li - Information Fusion, 2014 - Elsevier
This study investigates how to alleviate the class imbalance problems for constructing
unbiased classifiers when instances in one class are more than that in another. Since …

An oversampling method for class imbalance problems on large datasets

F Rodríguez-Torres, JF Martínez-Trinidad… - Applied Sciences, 2022 - mdpi.com
Several oversampling methods have been proposed for solving the class imbalance
problem. However, most of them require searching the k-nearest neighbors to generate …

Metode data mining untuk seleksi calon mahasiswa pada penerimaan mahasiswa baru di Universitas Pamulang

A Saifudin - Jurnal Teknologi, 2018 - jurnal.umj.ac.id
Universitas Pamulang berusaha memberikan pendidikan tinggi dengan biaya yang
terjangkau oleh kalangan bawah. Tetapi mahasiswanya banyak yang keluar di tiap …

Synthesizing labeled data to enhance soft sensor performance in data-scarce regions

Y Lyu, J Chen, Z Song - Control Engineering Practice, 2021 - Elsevier
Quality variables are key indicators of the operating performance in industrial processes.
Because they are difficult to measure, soft sensor models can be adopted to predict them …

Pendekatan Level Data untuk Menangani Ketidakseimbangan Kelas pada Prediksi Cacat Software

A Saifudin, RS Wahono - IlmuKomputer. com Journal of Software …, 2015 - neliti.com
Dataset software metrics secara umum bersifat tidak seimbang, hal ini dapat menurunkan
kinerja model prediksi cacat software karena cenderung menghasilkan prediksi kelas …

A Data Enhancement Algorithm for DDoS Attacks Using IoT

H Lv, Y Du, X Zhou, W Ni, X Ma - Sensors, 2023 - mdpi.com
With the rapid development of the Internet of Things (IoT), the frequency of attackers using
botnets to control IoT devices in order to perform distributed denial-of-service attacks (DDoS) …

A synthetic minority based on probabilistic distribution (SyMProD) oversampling for imbalanced datasets

I Kunakorntum, W Hinthong, P Phunchongharn - IEEE Access, 2020 - ieeexplore.ieee.org
Handling an imbalanced class problem is a challenging task in real-world applications. This
problem affects various prediction models that predict only the majority class and fail to …

Detecting representative data and generating synthetic samples to improve learning accuracy with imbalanced data sets

DC Li, SC Hu, LS Lin, CW Yeh - PloS one, 2017 - journals.plos.org
It is difficult for learning models to achieve high classification performances with imbalanced
data sets, because with imbalanced data sets, when one of the classes is much larger than …