Survey on deep learning with class imbalance

JM Johnson, TM Khoshgoftaar - Journal of big data, 2019 - Springer
The purpose of this study is to examine existing deep learning techniques for addressing
class imbalanced data. Effective classification with imbalanced data is an important area of …

A survey of predictive modeling on imbalanced domains

P Branco, L Torgo, RP Ribeiro - ACM computing surveys (CSUR), 2016 - dl.acm.org
Many real-world data-mining applications involve obtaining predictive models using
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …

Types of minority class examples and their influence on learning classifiers from imbalanced data

K Napierala, J Stefanowski - Journal of Intelligent Information Systems, 2016 - Springer
Many real-world applications reveal difficulties in learning classifiers from imbalanced data.
Although several methods for improving classifiers have been introduced, the identification …

SMOTE-RSB *: a hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets …

E Ramentol, Y Caballero, R Bello, F Herrera - Knowledge and information …, 2012 - Springer
Imbalanced data is a common problem in classification. This phenomenon is growing in
importance since it appears in most real domains. It has special relevance to highly …

Mining with rarity: a unifying framework

GM Weiss - ACM Sigkdd Explorations Newsletter, 2004 - dl.acm.org
Rare objects are often of great interest and great value. Until recently, however, rarity has
not received much attention in the context of data mining. Now, as increasingly complex real …

Learning when training data are costly: The effect of class distribution on tree induction

GM Weiss, F Provost - Journal of artificial intelligence research, 2003 - jair.org
For large, real-world inductive learning problems, the number of training examples often
must be limited due to the costs associated with procuring, preparing, and storing the …

Handling data irregularities in classification: Foundations, trends, and future challenges

S Das, S Datta, BB Chaudhuri - Pattern Recognition, 2018 - Elsevier
Most of the traditional pattern classifiers assume their input data to be well-behaved in terms
of similar underlying class distributions, balanced size of classes, the presence of a full set of …

Class imbalances versus small disjuncts

T Jo, N Japkowicz - ACM Sigkdd Explorations Newsletter, 2004 - dl.acm.org
It is often assumed that class imbalances are responsible for significant losses of
performance in standard classifiers. The purpose of this paper is to the question whether …

[PDF][PDF] The effect of class distribution on classifier learning: an empirical study

GM Weiss, F Provost - 2001 - storm.cis.fordham.edu
In this article we analyze the effect of class distribution on classifier learning. We begin by
describing the different ways in which class distribution affects learning and how it affects the …

[PDF][PDF] Cost-sensitive learning vs. sampling: Which is best for handling unbalanced classes with unequal error costs?

GM Weiss, K McCarthy, B Zabar - Dmin, 2007 - storm.cis.fordham.edu
The classifier built from a data set with a highly skewed class distribution generally predicts
the more frequently occurring classes much more often than the infrequently occurring …