A comprehensive review of dimensionality reduction techniques for feature selection and feature extraction

R Zebari, A Abdulazeez, D Zeebaree, D Zebari… - Journal of Applied …, 2020 - jastt.org
Journal of Applied Science and Technology Trends, 2020jastt.org
Due to sharp increases in data dimensions, working on every data mining or machine
learning (ML) task requires more efficient techniques to get the desired results. Therefore, in
recent years, researchers have proposed and developed many methods and techniques to
reduce the high dimensions of data and to attain the required accuracy. To ameliorate the
accuracy of learning features as well as to decrease the training time dimensionality
reduction is used as a pre-processing step, which can eliminate irrelevant data, noise, and …
Abstract
Due to sharp increases in data dimensions, working on every data mining or machine learning (ML) task requires more efficient techniques to get the desired results. Therefore, in recent years, researchers have proposed and developed many methods and techniques to reduce the high dimensions of data and to attain the required accuracy. To ameliorate the accuracy of learning features as well as to decrease the training time dimensionality reduction is used as a pre-processing step, which can eliminate irrelevant data, noise, and redundant features. Dimensionality reduction (DR) has been performed based on two main methods, which are feature selection (FS) and feature extraction (FE). FS is considered an important method because data is generated continuously at an ever-increasing rate; some serious dimensionality problems can be reduced with this method, such as decreasing redundancy effectively, eliminating irrelevant data, and ameliorating result comprehensibility. Moreover, FE transacts with the problem of finding the most distinctive, informative, and decreased set of features to ameliorate the efficiency of both the processing and storage of data. This paper offers a comprehensive approach to FS and FE in the scope of DR. Moreover, the details of each paper, such as used algorithms/approaches, datasets, classifiers, and achieved results are comprehensively analyzed and summarized. Besides, a systematic discussion of all of the reviewed methods to highlight authors' trends, determining the method (s) has been done, which significantly reduced computational time, and selecting the most accurate classifiers. As a result, the different types of both methods have been discussed and analyzed the findings.
jastt.org
以上显示的是最相近的搜索结果。 查看全部搜索结果