作者
Diego García-Gil, Sergio Ramírez-Gallego, Salvador García, Francisco Herrera
发表日期
2018/6/15
期刊
Knowledge-Based Systems
卷号
150
页码范围
166-174
出版商
Elsevier
简介
Humongous amounts of data have created a lot of challenges in terms of data computation and analysis. Classic data mining techniques are not prepared for the new space and time requirements. Discretization and dimensionality reduction are two of the data reduction tasks in knowledge discovery. Random Projection Random Discretization is a novel and recently proposed ensemble method by Ahmad and Brown in 2014 that performs discretization and dimensionality reduction to create more informative data. Despite the good efficiency of random projections in dimensionality reduction, more robust methods like Principal Components Analysis (PCA) can improve the performance.
We propose a new ensemble method to overcome this drawback using the Apache Spark platform and PCA for dimension reduction, named Principal Components Analysis Random Discretization Ensemble. Experimental results on …
引用总数
2018201920202021202220232024312159345
学术搜索中的文章
D García-Gil, S Ramírez-Gallego, S García, F Herrera - Knowledge-Based Systems, 2018