A review on classification of imbalanced data for wireless sensor networks

H Patel, D Singh Rajput… - International …, 2020 - journals.sagepub.com
Classification of imbalanced data is a vastly explored issue of the last and present decade
and still keeps the same importance because data are an essential term today and it …

A review of class imbalance problem

SM Abd Elrahman, A Abraham - Journal of Network and Innovative …, 2013 - cspub-jnic.org
Class imbalance is one of the challenges of machine learning and data mining fields.
Imbalance data sets degrades the performance of data mining and machine learning …

The importance of being external. methodological insights for the external validation of machine learning models in medicine

F Cabitza, A Campagner, F Soares… - Computer Methods and …, 2021 - Elsevier
Abstract Background and Objective Medical machine learning (ML) models tend to perform
better on data from the same cohort than on new data, often due to overfitting, or co-variate …

Features dimensionality reduction approaches for machine learning based network intrusion detection

R Abdulhammed, H Musafer, A Alessa, M Faezipour… - Electronics, 2019 - mdpi.com
The security of networked systems has become a critical universal issue that influences
individuals, enterprises and governments. The rate of attacks against networked systems …

An insider data leakage detection using one-hot encoding, synthetic minority oversampling and machine learning techniques

T Al-Shehari, RA Alsowail - Entropy, 2021 - mdpi.com
Insider threats are malicious acts that can be carried out by an authorized employee within
an organization. Insider threats represent a major cybersecurity challenge for private and …

GHOST: adjusting the decision threshold to handle imbalanced data in machine learning

C Esposito, GA Landrum, N Schneider… - Journal of Chemical …, 2021 - ACS Publications
Machine learning classifiers trained on class imbalanced data are prone to overpredict the
majority class. This leads to a larger misclassification rate for the minority class, which in …

A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data

Z Xu, D Shen, T Nie, Y Kou, N Yin, X Han - Information Sciences, 2021 - Elsevier
The algorithm of C4. 5 decision tree has the advantages of high classification accuracy, fast
calculation speed and comprehensible classification rules, so it is widely used for medical …

An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics

V López, A Fernández, S García, V Palade… - Information sciences, 2013 - Elsevier
Training classifiers with datasets which suffer of imbalanced class distributions is an
important problem in data mining. This issue occurs when the number of examples …

Mahakil: Diversity based oversampling approach to alleviate the class imbalance issue in software defect prediction

KE Bennin, J Keung, P Phannachitta… - IEEE Transactions …, 2017 - ieeexplore.ieee.org
Highly imbalanced data typically make accurate predictions difficult. Unfortunately, software
defect datasets tend to have fewer defective modules than non-defective modules. Synthetic …

Fault detection in gears using fault samples enlarged by a combination of numerical simulation and a generative adversarial network

Y Gao, X Liu, J Xiang - IEEE/ASME Transactions on …, 2021 - ieeexplore.ieee.org
It is inevitable for gear to become damaged, which has a profound effect on the performance
of gear transmission systems. Solving the problem of gear fault detection using artificial …