A survey on pre-processing techniques: Relevant issues in the context of environmental data mining

K Gibert, M Sànchez–Marrè, J Izquierdo - AI Communications, 2016 - content.iospress.com
One of the important issues related with all types of data analysis, either statistical data
analysis, machine learning, data mining, data science or whatever form of data-driven …

A scalable memetic algorithm for simultaneous instance and feature selection

N García-Pedrajas, A de Haro-García… - Evolutionary …, 2014 - direct.mit.edu
Instance selection is becoming increasingly relevant due to the huge amount of data that is
constantly produced in many fields of research. At the same time, most of the recent pattern …

Training green ai models using elite samples

M Alswaitti, R Verdecchia, G Danoy, P Bouvry… - arXiv preprint arXiv …, 2024 - arxiv.org
The substantial increase in AI model training has considerable environmental implications,
mandating more energy-efficient and sustainable AI practices. On the one hand, data-centric …

From supervised instance and feature selection algorithms to dual selection: a review

F Ros, S Guillaume - … Techniques for Supervised or Unsupervised Tasks, 2020 - Springer
This chapter reviews the data reduction problem for instance and feature selection methods
in the context of supervised classification. In the first part, instance and feature selections are …

A study on the application of instance selection techniques in genetic fuzzy rule-based classification systems: Accuracy-complexity trade-off

M Fazzolari, B Giglio, R Alcalá, F Marcelloni… - Knowledge-Based …, 2013 - Elsevier
In the framework of genetic fuzzy systems, the computational time required by genetic
algorithms for generating fuzzy rule-based models from data increases considerably with the …

On the use of evolutionary feature selection for improving fuzzy rough set based prototype selection

J Derrac, N Verbiest, S García, C Cornelis, F Herrera - Soft Computing, 2013 - Springer
The k-nearest neighbors classifier is a widely used classification method that has proven to
be very effective in supervised learning tasks. In this paper, a fuzzy rough set method for …

[HTML][HTML] PARIS: Partial instance and training set selection. A new scalable approach to multi-label classification

N García-Pedrajas, JM Cuevas-Muñoz… - Information …, 2023 - Elsevier
Multi-label classification has recently attracted research interest as a data mining task. Many
current applications in data mining address problems that have instances belonging to more …

A stochastic approximation approach to fixed instance selection

GFA Yeo, D Akman, I Hudson, J Chan - Information Sciences, 2023 - Elsevier
Instance selection plays a critical role in enhancing the efficacy and efficiency of machine
learning tools when utilised for a data mining task. This study proposes a fixed instance …

Unsupervised instance selection via conjectural hyperrectangles

F Aydin - Neural Computing and Applications, 2023 - Springer
Abstract Machine learning algorithms spend a lot of time processing data because they are
not fast enough to commit huge data sets. Instance selection algorithms especially aim to …

Multi-selection of instances: A straightforward way to improve evolutionary instance selection

N García-Pedrajas, J Pérez-Rodríguez - Applied Soft Computing, 2012 - Elsevier
Although many more complex learning algorithms exist, k-nearest neighbor is still one of the
most successful classifiers in real-world applications. One of the ways of scaling up the k …