MRPR: A MapReduce solution for prototype reduction in big data classification

I Triguero, D Peralta, J Bacardit, S García, F Herrera - neurocomputing, 2015 - Elsevier
In the era of big data, analyzing and extracting knowledge from large-scale data sets is a
very interesting and challenging task. The application of standard data mining tools in such …

A taxonomy and experimental study on prototype generation for nearest neighbor classification

I Triguero, J Derrac, S Garcia… - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
The nearest neighbor (NN) rule is one of the most successfully used techniques to resolve
classification and pattern recognition tasks. Despite its high classification accuracy, this rule …

Evolutionary feature selection for big data classification: A mapreduce approach

D Peralta, S Del Río, S Ramírez-Gallego… - Mathematical …, 2015 - Wiley Online Library
Nowadays, many disciplines have to deal with big datasets that additionally involve a high
number of features. Feature selection methods aim at eliminating noisy, redundant, or …

Selecting critical patterns based on local geometrical and statistical information

Y Li, L Maguire - IEEE transactions on pattern analysis and …, 2010 - ieeexplore.ieee.org
Pattern selection methods have been traditionally developed with a dependency on a
specific classifier. In contrast, this paper presents a method that selects critical patterns …

A Novel Template Reduction Approach for the -Nearest Neighbor Method

HA Fayed, AF Atiya - IEEE Transactions on Neural Networks, 2009 - ieeexplore.ieee.org
The K-nearest neighbor (KNN) rule is one of the most widely used pattern classification
algorithms. For large data sets, the computational demands for classifying patterns using …

Differential evolution for optimizing the positioning of prototypes in nearest neighbor classification

I Triguero, S García, F Herrera - Pattern recognition, 2011 - Elsevier
Nearest neighbor classification is one of the most used and well known methods in data
mining. Its simplest version has several drawbacks, such as low efficiency, high storage …

Improved pseudo nearest neighbor classification

J Gou, Y Zhan, Y Rao, X Shen, X Wang… - Knowledge-Based Systems, 2014 - Elsevier
Abstract k-Nearest neighbor (KNN) rule is a very simple and powerful classification
algorithm. In this article, we propose a new KNN-based classifier, called the local mean …

Feature selection, online feature selection techniques for big data classification:-a review

SG Devi, M Sabrigiriraj - 2018 International Conference on …, 2018 - ieeexplore.ieee.org
In the recent times, several disciplines have to tackle with huge datasets, which are involved
with a huge number of additional features. Feature Selection (FS) techniques target at …

[HTML][HTML] Quad division prototype selection-based k-nearest neighbor classifier for click fraud detection from highly skewed user click dataset

D Sisodia, DS Sisodia - … Science and Technology, an International Journal, 2022 - Elsevier
In online advertising, the user-clicks dataset based fraudulent publishers' classification
models exhibit poor performance due to high skewness in class distribution of the …

[HTML][HTML] Multilabel Prototype Generation for data reduction in K-Nearest Neighbour classification

JJ Valero-Mas, AJ Gallego, P Alonso-Jiménez… - Pattern Recognition, 2023 - Elsevier
Prototype Generation (PG) methods are typically considered for improving the efficiency of
the k-Nearest Neighbour (k NN) classifier when tackling high-size corpora. Such …