Virtual sample generation for small sample learning: a survey, recent developments and future prospects

J Wen, A Su, X Wang, H Xu, J Ma, K Chen, X Ge, Z Xu… - Neurocomputing, 2024 - Elsevier
Virtual sample generation (VSG) technology aims to generate virtual samples based on real
samples, in order to expand the size of the datasets and improve model performance …

Self-adaptive oversampling method based on the complexity of minority data in imbalanced datasets classification

X Tao, X Guo, Y Zheng, X Zhang, Z Chen - Knowledge-Based Systems, 2023 - Elsevier
Learning from imbalanced datasets is a nontrivial task for supervised learning community.
Traditional classifiers may have difficulties to learn the concept related to the minority class …

A cluster-based SMOTE both-sampling (CSBBoost) ensemble algorithm for classifying imbalanced data

AR Salehi, M Khedmati - Scientific Reports, 2024 - nature.com
In this paper, a Cluster-based Synthetic minority oversampling technique (SMOTE) Both-
sampling (CSBBoost) ensemble algorithm is proposed for classifying imbalanced data. In …

Selective multiple kernel fuzzy clustering with locality preserved ensemble

C Zhang, L Chen, YF Yu, YP Zhao, Z Shi… - Knowledge-Based …, 2024 - Elsevier
Multiple kernel fuzzy clustering (MKFC) has demonstrated promising performance in
capturing the non-linear relationships within data. However, its effectiveness relies heavily …

CTGAN-MOS: Conditional generative adversarial network based minority-class-augmented oversampling scheme for imbalanced problems

A Majeed, SO Hwang - IEEE Access, 2023 - ieeexplore.ieee.org
This paper proposes a novel data augmentation scheme called the conditional generative
adversarial network minority-class-augmented oversampling scheme (CTGAN-MOS) for …

HGDO: An oversampling technique based on hypergraph recognition and Gaussian distribution

L Jia, Z Wang, P Sun, P Wang - Information Sciences, 2024 - Elsevier
The synthetic minority oversampling technique (SMOTE) is the most prevalent solution in
class imbalance learning. While SMOTE and its variant methods handle imbalanced data …

OPT-RNN-DBSVM: OPTimal recurrent neural network and density-based support vector machine

K El Moutaouakil, A El Ouissari, A Olaru, V Palade… - Mathematics, 2023 - mdpi.com
When implementing SVMs, two major problems are encountered:(a) the number of local
minima of dual-SVM increases exponentially with the number of samples and (b) the …

Synthesizing credit data using autoencoders and generative adversarial networks

G Oreski - Knowledge-Based Systems, 2023 - Elsevier
Data quality is an essential element necessary for the development of a successful machine-
learning project. One of the biggest challenges in various real-world application domains is …

TDMO: Dynamic multi-dimensional oversampling for exploring data distribution based on extreme gradient boosting learning

L Jia, Z Wang, P Sun, Z Xu, S Yang - Information Sciences, 2023 - Elsevier
The synthetic minority oversampling technique (SMOTE) is the most general and popular
solution for imbalanced data. Although SMOTE is effective in solving the class imbalance …

Entropy‐based hybrid sampling (EHS) method to handle class overlap in highly imbalanced dataset

A Kumar, D Singh, RS Yadav - Expert Systems, 2024 - Wiley Online Library
Class imbalance and class overlap create difficulties in the training phase of the standard
machine learning algorithm. Its performance is not well in minority classes, especially when …