Feature selection for text classification: A review

X Deng, Y Li, J Weng, J Zhang - Multimedia Tools and Applications, 2019 - Springer
Big multimedia data is heterogeneous in essence, that is, the data may be a mixture of
video, audio, text, and images. This is due to the prevalence of novel applications in recent …

Toward optimal feature selection in naive Bayes for text categorization

B Tang, S Kay, H He - IEEE transactions on knowledge and …, 2016 - ieeexplore.ieee.org
Automated feature selection is important for text categorization to reduce feature size and to
speed up learning process of classifiers. In this paper, we present a novel and efficient …

On strategies for imbalanced text classification using SVM: A comparative study

A Sun, EP Lim, Y Liu - Decision Support Systems, 2009 - Elsevier
Many real-world text classification tasks involve imbalanced training examples. The
strategies proposed to address the imbalanced classification (eg, resampling, instance …

A fuzzy self-constructing feature clustering algorithm for text classification

JY Jiang, RJ Liou, SJ Lee - IEEE transactions on knowledge …, 2010 - ieeexplore.ieee.org
Feature clustering is a powerful method to reduce the dimensionality of feature vectors for
text classification. In this paper, we propose a fuzzy similarity-based self-constructing …

An ensemble feature selection algorithm based on PageRank centrality and fuzzy logic

M Joodaki, MB Dowlatshahi, NZ Joodaki - Knowledge-Based Systems, 2021 - Elsevier
One of the crucial processes in machine learning algorithms to improve the performance as
well as, in some cases, to reduce the computational cost is feature selection. In other words …

Online feature selection with group structure analysis

J Wang, M Wang, P Li, L Liu, Z Zhao… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Online selection of dynamic features has attracted intensive interest in recent years.
However, existing online feature selection methods evaluate features individually and …

Supervised Hebb rule based feature selection for text classification

H Wang, M Hong - Information Processing & Management, 2019 - Elsevier
Text documents usually contain high dimensional non-discriminative (irrelevant and noisy)
terms which lead to steep computational costs and poor learning performance of text …

Feature selection with redundancy-complementariness dispersion

Z Chen, C Wu, Y Zhang, Z Huang, B Ran… - Knowledge-Based …, 2015 - Elsevier
Feature selection has attracted significant attention in data mining and machine learning in
the past decades. Many existing feature selection methods eliminate redundancy by …

Feature selection with attributes clustering by maximal information coefficient

X Zhao, W Deng, Y Shi - Procedia Computer Science, 2013 - Elsevier
Feature selection is usually a separate procedure which can not benefit from result of the
data exploration. In this paper, we propose a unsupervised feature selection method which …

Sample cutting method for imbalanced text sentiment classification based on BRC

S Wang, D Li, L Zhao, J Zhang - Knowledge-Based Systems, 2013 - Elsevier
The vast subjective texts spreading all over the Internet promoted the demand for text
sentiment classification technology. A well-known fact that often weakens the performance of …