查看文章

mdpi.com 中的 [HTML]

A tri-stage wrapper-filter feature selection framework for disease classification

作者

Moumita Mandal, Pawan Kumar Singh, Muhammad Fazal Ijaz, Jana Shafi, Ram Sarkar

发表日期

2021/8/18

期刊

Sensors

卷号

期号

页码范围

5571

出版商

MDPI

简介

In machine learning and data science, feature selection is considered as a crucial step of data preprocessing. When we directly apply the raw data for classification or clustering purposes, sometimes we observe that the learning algorithms do not perform well. One possible reason for this is the presence of redundant, noisy, and non-informative features or attributes in the datasets. Hence, feature selection methods are used to identify the subset of relevant features that can maximize the model performance. Moreover, due to reduction in feature dimension, both training time and storage required by the model can be reduced as well. In this paper, we present a tri-stage wrapper-filter-based feature selection framework for the purpose of medical report-based disease detection. In the first stage, an ensemble was formed by four filter methods—Mutual Information, ReliefF, Chi Square, and Xvariance—and then each feature from the union set was assessed by three classification algorithms—support vector machine, naïve Bayes, and k-nearest neighbors—and an average accuracy was calculated. The features with higher accuracy were selected to obtain a preliminary subset of optimal features. In the second stage, Pearson correlation was used to discard highly correlated features. In these two stages, XGBoost classification algorithm was applied to obtain the most contributing features that, in turn, provide the best optimal subset. Then, in the final stage, we fed the obtained feature subset to a meta-heuristic algorithm, called whale optimization algorithm, in order to further reduce the feature set and to achieve higher accuracy. We evaluated the …

引用总数

被引用次数：85

20212022202320245 30 38 11

学术搜索中的文章

A tri-stage wrapper-filter feature selection framework for disease classification

M Mandal, PK Singh, MF Ijaz, J Shafi, R Sarkar - Sensors, 2021

被引用次数：85 相关文章所有 10 个版本