A survey on data preprocessing for data stream mining: Current status and future directions

S Ramírez-Gallego, B Krawczyk, S García, M Woźniak… - Neurocomputing, 2017 - Elsevier
Data preprocessing and reduction have become essential techniques in current knowledge
discovery scenarios, dominated by increasingly large datasets. These methods aim at …

Online feature selection system for big data classification based on multi-objective automated negotiation

F BenSaid, AM Alimi - Pattern Recognition, 2021 - Elsevier
Feature Selection (FS) plays an important role in learning and classification tasks. Its
objective is to select the relevant and non-redundant features. Considering the huge number …

Spatio-temporal event forecasting using incremental multi-source feature learning

L Zhao, Y Gao, J Ye, F Chen, Y Ye, CT Lu… - ACM Transactions on …, 2021 - dl.acm.org
The forecasting of significant societal events such as civil unrest and economic crisis is an
interesting and challenging problem which requires both timeliness, precision, and …

Parallel feature selection for distributed-memory clusters

J González-Domínguez, V Bolón-Canedo, B Freire… - Information …, 2019 - Elsevier
Feature selection is nowadays an extremely important data mining stage in the field of
machine learning due to the appearance of problems of high dimensionality. In the literature …

[HTML][HTML] Parallel-FST: A feature selection library for multicore clusters

B Beceiro, J González-Domínguez, J Touriño - Journal of Parallel and …, 2022 - Elsevier
Feature selection is a subfield of machine learning focused on reducing the dimensionality
of datasets by performing a computationally intensive process. This work presents Parallel …

CUDA-JMI: Acceleration of feature selection on heterogeneous systems

J González-Domínguez, RR Expósito… - Future Generation …, 2020 - Elsevier
Feature selection is a crucial step nowadays in machine learning and data analytics to
remove irrelevant and redundant characteristics and thus to provide fast and reliable …

Effective evolutionary multilabel feature selection under a budget constraint

J Lee, W Seo, DW Kim - Complexity, 2018 - Wiley Online Library
Multilabel feature selection involves the selection of relevant features from multilabeled
datasets, resulting in improved multilabel learning accuracy. Evolutionary search‐based …

Learning time-series shapelets via supervised feature selection

A Yamaguchi, K Ueno - Proceedings of the 2021 SIAM International …, 2021 - SIAM
Shapelets are time-series segments effective for classifying time-series instances. Joint
learning of both classifiers and shapelets has been studied in recent years because this …

Streaming feature selection via graph diffusion

W Zheng, S Chen, Z Fu, J Li, J Yang - Information Sciences, 2022 - Elsevier
Streaming feature selection for unlabeled data aims to remove redundant and irrelevant
features from the continuously arriving features without label information. Most existing …

SEM: A softmax-based ensemble model for CTR estimation in real-time bidding advertising

WY Zhu, CH Wang, WY Shih, WC Peng… - … Conference on Big …, 2017 - ieeexplore.ieee.org
In Real-Time Bidding (RTB) advertising, evaluating the Click-Through Rate (CTR) of a bid
request and an ad is important for bidding strategy optimization on Demand-Side Platforms …