A survey of predictive modeling on imbalanced domains

P Branco, L Torgo, RP Ribeiro - ACM computing surveys (CSUR), 2016 - dl.acm.org
Many real-world data-mining applications involve obtaining predictive models using
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …

[PDF][PDF] Handling imbalanced datasets: A review

S Kotsiantis, D Kanellopoulos… - … transactions on computer …, 2006 - academia.edu
Learning classifiers from imbalanced or skewed datasets is an important topic, arising very
often in practice in classification problems. In such problems, almost all the instances are …

Cost-sensitive learning of deep feature representations from imbalanced data

SH Khan, M Hayat, M Bennamoun… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Class imbalance is a common problem in the case of real-world object detection and
classification tasks. Data of some classes are abundant, making them an overrepresented …

Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning

H Han, WY Wang, BH Mao - International conference on intelligent …, 2005 - Springer
In recent years, mining with imbalanced data sets receives more and more attentions in both
theoretical and practical aspects. This paper introduces the importance of imbalanced data …

Exploratory undersampling for class-imbalance learning

XY Liu, J Wu, ZH Zhou - … Systems, Man, and Cybernetics, Part B …, 2008 - ieeexplore.ieee.org
Undersampling is a popular method in dealing with class-imbalance problems, which uses
only a subset of the majority class and thus is very efficient. The main deficiency is that many …

On the class imbalance problem

X Guo, Y Yin, C Dong, G Yang… - 2008 Fourth international …, 2008 - ieeexplore.ieee.org
The class imbalance problem has been recognized in many practical domains and a hot
topic of machine learning in recent years. In such a problem, almost all the examples are …

Combating the small sample class imbalance problem using feature selection

M Wasikowski, X Chen - IEEE Transactions on knowledge and …, 2009 - ieeexplore.ieee.org
The class imbalance problem is encountered in real-world applications of machine learning
and results in a classifier's suboptimal performance. Researchers have rigorously studied …

Fast: a roc-based feature selection metric for small samples and imbalanced data classification problems

X Chen, M Wasikowski - Proceedings of the 14th ACM SIGKDD …, 2008 - dl.acm.org
The class imbalance problem is encountered in a large number of practical applications of
machine learning and data mining, for example, information retrieval and filtering, and the …

Preprocessing unbalanced data using support vector machine

MAH Farquad, I Bose - Decision Support Systems, 2012 - Elsevier
This paper deals with the application of support vector machine (SVM) to deal with the class
imbalance problem. The objective of this paper is to examine the feasibility and efficiency of …

A big data MapReduce framework for fault diagnosis in cloud-based manufacturing

A Kumar, R Shankar, A Choudhary… - International Journal of …, 2016 - Taylor & Francis
This research develops a MapReduce framework for automatic pattern recognition based on
fault diagnosis by solving data imbalance problem in a cloud-based manufacturing (CBM) …