Cost-sensitive decision tree ensembles for effective imbalanced classification

B Krawczyk, M Woźniak, G Schaefer - Applied Soft Computing, 2014 - Elsevier
Real-life datasets are often imbalanced, that is, there are significantly more training samples
available for some classes than for others, and consequently the conventional aim of …

[HTML][HTML] Regression conformal prediction with random forests

U Johansson, H Boström, T Löfström, H Linusson - Machine learning, 2014 - Springer
Regression conformal prediction produces prediction intervals that are valid, ie, the
probability of excluding the correct target value is bounded by a predefined confidence level …

Boosted SVM for extracting rules from imbalanced data in application to prediction of the post-operative life expectancy in the lung cancer patients

M Zięba, JM Tomczak, M Lubicz, J Świątek - Applied soft computing, 2014 - Elsevier
In this paper, we present boosted SVM dedicated to solve imbalanced data problems.
Proposed solution combines the benefits of using ensemble classifiers for uneven data …

Adjusted F-measure and kernel scaling for imbalanced data learning

A Maratea, A Petrosino, M Manzo - Information Sciences, 2014 - Elsevier
Rare events are involved in many challenging real world classification problems, where the
minority class is usually the most expensive to sample and to label. As a consequence …

On the importance of the validation technique for classification with imbalanced datasets: Addressing covariate shift when data is skewed

V López, A Fernández, F Herrera - Information Sciences, 2014 - Elsevier
In the field of Data Mining, the estimation of the quality of the learned models is a key step in
order to select the most appropriate tool for the problem to be solved. Traditionally, a k-fold …

Enhancing Multiclass Classification in FARC-HD Fuzzy Classifier: On the Synergy Between -Dimensional Overlap Functions and Decomposition Strategies

M Elkano, M Galar, JA Sanz… - … on Fuzzy Systems, 2014 - ieeexplore.ieee.org
There are many real-world classification problems involving multiple classes, eg, in
bioinformatics, computer vision, or medicine. These problems are generally more difficult …

Overview on evolutionary subgroup discovery: analysis of the suitability and potential of the search performed by evolutionary algorithms

CJ Carmona, P González… - … Reviews: Data Mining …, 2014 - Wiley Online Library
Subgroup discovery (SD) is a descriptive data mining technique using supervised learning.
In this article, we review the use of evolutionary algorithms (EAs) for SD. In particular, we will …

Analyzing the presence of noise in multi-class problems: alleviating its influence with the one-vs-one decomposition

JA Sáez, M Galar, J Luengo, F Herrera - Knowledge and information …, 2014 - Springer
The presence of noise in data is a common problem that produces several negative
consequences in classification problems. In multi-class problems, these consequences are …

Multitask TSK fuzzy system modeling by mining intertask common hidden structure

Y Jiang, FL Chung, H Ishibuchi… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
The classical fuzzy system modeling methods implicitly assume data generated from a
single task, which is essentially not in accordance with many practical scenarios where data …

Fuzzy nearest neighbor algorithms: Taxonomy, experimental analysis and prospects

J Derrac, S García, F Herrera - Information Sciences, 2014 - Elsevier
In recent years, many nearest neighbor algorithms based on fuzzy sets theory have been
developed. These methods form a field, known as fuzzy nearest neighbor classification …