作者
Daniel Peralta, Sara Del Río, Sergio Ramírez-Gallego, Isaac Triguero, Jose M Benitez, Francisco Herrera
发表日期
2015
期刊
Mathematical Problems in Engineering
卷号
2015
期号
1
页码范围
246139
出版商
Hindawi Publishing Corporation
简介
Nowadays, many disciplines have to deal with big datasets that additionally involve a high number of features. Feature selection methods aim at eliminating noisy, redundant, or irrelevant features that may deteriorate the classification performance. However, traditional methods lack enough scalability to cope with datasets of millions of instances and extract successful results in a delimited time. This paper presents a feature selection algorithm based on evolutionary computation that uses the MapReduce paradigm to obtain subsets of features from big datasets. The algorithm decomposes the original dataset in blocks of instances to learn from them in the map phase; then, the reduce phase merges the obtained partial results into a final vector of feature weights, which allows a flexible application of the feature selection procedure using a threshold to determine the selected subset of features. The feature selection …
引用总数
2015201620172018201920202021202220232024213182431302319115
学术搜索中的文章
D Peralta, S Del Río, S Ramírez-Gallego, I Triguero… - Mathematical Problems in Engineering, 2015