作者
David López, Sergio Ramírez-Gallego, Salvador García, Ning Xiong, Francisco Herrera
发表日期
2021/5/1
期刊
Information Sciences
卷号
558
页码范围
124-139
出版商
Elsevier
简介
With the advent of Big Data era, data reduction methods are in highly demand given their ability to simplify huge data, and ease complex learning processes. Concretely, algorithms able to select relevant dimensions from a set of millions are of huge importance. Although effective, these techniques also suffer from the “scalability” curse when they are brought into tackle large-scale problems.
In this paper, we propose a distributed feature weighting algorithm which precisely estimates feature importance in large datasets using the well-know algorithm RELIEF in small problems. Our solution, called BELIEF, incorporates a novel redundancy elimination measure that generates similar schemes to those based on entropy, but at a much lower time cost. Furthermore, BELIEF provides a smooth scale-up when more instances are required to increase precision in estimations.
Empirical tests performed on our method illustrate …
引用总数
20212022202320243324
学术搜索中的文章