Big data preprocessing: methods and prospects S García, S Ramírez-Gallego, J Luengo, JM Benítez, F Herrera Big data analytics 1, 1-22, 2016 | 671 | 2016 |
A survey on data preprocessing for data stream mining: Current status and future directions S Ramírez-Gallego, B Krawczyk, S García, M Woźniak, F Herrera Neurocomputing 239, 39-57, 2017 | 513 | 2017 |
kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data J Maillo, S Ramírez, I Triguero, F Herrera Knowledge-Based Systems 117, 3-15, 2017 | 376 | 2017 |
Web usage mining to improve the design of an e-commerce website: OrOliveSur. com CJ Carmona, S Ramírez-Gallego, F Torres, E Bernal, MJ del Jesus, ... Expert Systems with Applications 39 (12), 11243-11249, 2012 | 206 | 2012 |
Data discretization: taxonomy and big data challenge S Ramírez‐Gallego, S García, H Mouriño‐Talín, D Martínez‐Rego, ... Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 6 (1), 5-21, 2016 | 188 | 2016 |
Big Data: Tutorial and guidelines on information and process fusion for analytics algorithms with MapReduce S Ramírez-Gallego, A Fernández, S García, M Chen, F Herrera Information Fusion 42, 51-61, 2018 | 187 | 2018 |
Fast‐mRMR: Fast minimum redundancy maximum relevance algorithm for high‐dimensional big data S Ramírez‐Gallego, I Lastra, D Martínez‐Rego, V Bolón‐Canedo, ... International Journal of Intelligent Systems 32 (2), 134-152, 2017 | 177 | 2017 |
Evolutionary feature selection for big data classification: A mapreduce approach D Peralta, S Del Río, S Ramírez-Gallego, I Triguero, JM Benitez, ... Mathematical Problems in Engineering 2015 (1), 246139, 2015 | 177 | 2015 |
A comparison on scalability for batch big data processing on Apache Spark and Apache Flink D García-Gil, S Ramírez-Gallego, S García, F Herrera Big Data Analytics 2, 1-11, 2017 | 127 | 2017 |
An information theory-based feature selection framework for big data under apache spark S Ramírez-Gallego, H Mouriño-Talín, D Martínez-Rego, V Bolón-Canedo, ... IEEE Transactions on Systems, Man, and Cybernetics: Systems 48 (9), 1441-1453, 2017 | 115 | 2017 |
Big data preprocessing J Luengo, D García-Gil, S Ramírez-Gallego, S García, F Herrera Cham: Springer, 2020 | 111 | 2020 |
Nearest neighbor classification for high-speed big data streams using spark S Ramírez-Gallego, B Krawczyk, S García, M Woźniak, JM Benítez, ... IEEE Transactions on Systems, Man, and Cybernetics: Systems 47 (10), 2727-2739, 2017 | 82 | 2017 |
Principal components analysis random discretization ensemble for big data D García-Gil, S Ramírez-Gallego, S García, F Herrera Knowledge-Based Systems 150, 166-174, 2018 | 51 | 2018 |
Multivariate Discretization Based on Evolutionary Cut Points Selection for Classification S Ramirez-Gallego, S Garcia, JM Benitez, F Herrera IEEE Transactions on Cybernetics, 2015 | 51 | 2015 |
Big Data: Preprocesamiento y calidad de datos F Herrera novática 237 (1), 17-20, 2016 | 49 | 2016 |
A forecasting methodology for workload forecasting in cloud systems FJ Baldan, S Ramirez-Gallego, C Bergmeir, F Herrera, JM Benitez IEEE Transactions on Cloud Computing 6 (4), 929-941, 2016 | 48 | 2016 |
A distributed evolutionary multivariate discretizer for big data processing on apache spark S Ramírez-Gallego, S García, JM Benítez, F Herrera Swarm and Evolutionary Computation 38, 240-250, 2018 | 40 | 2018 |
Distributed entropy minimization discretizer for big data analysis under apache spark S Ramirez-Gallego, S Garcia, H Mourino-Talin, D Martinez-Rego, ... 2015 IEEE Trustcom/BigDataSE/ISPA 2, 33-40, 2015 | 32 | 2015 |
Online entropy-based discretization for data streaming classification S Ramírez-Gallego, S García, F Herrera Future Generation Computer Systems 86, 59-70, 2018 | 25 | 2018 |
BELIEF: A distance-based redundancy-proof feature selection method for Big Data D López, S Ramírez-Gallego, S García, N Xiong, F Herrera Information Sciences 558, 124-139, 2021 | 12 | 2021 |