Class prediction for high-dimensional class-imbalanced data

R Blagus, L Lusa - BMC bioinformatics, 2010 - Springer
Background The goal of class prediction studies is to develop rules to accurately predict the
class membership of new samples. The rules are derived using the values of the variables …

Class-imbalanced classifiers for high-dimensional data

WJ Lin, JJ Chen - Briefings in bioinformatics, 2013 - academic.oup.com
A class-imbalanced classifier is a decision rule to predict the class membership of new
samples from an available data set where the class sizes differ considerably. When the class …

Hellinger distance-based stable sparse feature selection for high-dimensional class-imbalanced data

GH Fu, YJ Wu, MJ Zong, J Pan - BMC bioinformatics, 2020 - Springer
Background Feature selection in class-imbalance learning has gained increasing attention
in recent years due to the massive growth of high-dimensional class-imbalanced data …

SMOTE for high-dimensional class-imbalanced data

R Blagus, L Lusa - BMC bioinformatics, 2013 - Springer
Background Classification using class-imbalanced data is biased in favor of the majority
class. The bias is even larger for high-dimensional data, where the number of variables …

Learning from high-dimensional biomedical datasets: the issue of class imbalance

B Pes - IEEE Access, 2020 - ieeexplore.ieee.org
As witnessed by a vast corpus of literature, dimensionality reduction is a fundamental step
for biomedical data analysis. Indeed, in this domain, there is often the need for coping with a …

Feature selection for high-dimensional class-imbalanced data sets using support vector machines

S Maldonado, R Weber, F Famili - Information sciences, 2014 - Elsevier
Feature selection and classification of imbalanced data sets are two of the most interesting
machine learning challenges, attracting a growing attention from both, industry and …

Coupling different methods for overcoming the class imbalance problem

L Nanni, C Fantozzi, N Lazzarini - Neurocomputing, 2015 - Elsevier
Many classification problems must deal with imbalanced datasets where one class–the
majority class–outnumbers the other classes. Standard classification methods do not …

On the joint-effect of class imbalance and overlap: a critical review

MS Santos, PH Abreu, N Japkowicz… - Artificial Intelligence …, 2022 - Springer
Current research on imbalanced data recognises that class imbalance is aggravated by
other data intrinsic characteristics, among which class overlap stands out as one of the most …

The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets

T Saito, M Rehmsmeier - PloS one, 2015 - journals.plos.org
Binary classifiers are routinely evaluated with performance measures such as sensitivity and
specificity, and performance is frequently illustrated with Receiver Operating Characteristics …

Optimally splitting cases for training and testing high dimensional classifiers

KK Dobbin, RM Simon - BMC medical genomics, 2011 - Springer
Background We consider the problem of designing a study to develop a predictive classifier
from high dimensional data. A common study design is to split the sample into a training set …