A feature selection method for improved document classification

T Basu, CA Murthy - Advanced Data Mining and Applications: 8th …, 2012 - Springer
The aim of text document classification is to automatically group a document to a predefined
class. The main problem of document classification is high dimensionality and sparsity of the …

Categorical document frequency based feature selection for text categorization

Z Zhen, H Wang, L Han, Z Shi - 2011 International Conference …, 2011 - ieeexplore.ieee.org
Effective feature selection methods are essential for improving the accuracy and efficiency of
text categorization. Motivated by document frequency, we proposed a new filter-based …

Research on the feature selection techniques used in text classification

Y Li, C Chen - 2012 9th International Conference on Fuzzy …, 2012 - ieeexplore.ieee.org
With the ever-increasing number of digital documents, the ability to automatically classify
those documents both quickly and accurately is becoming more critical and difficult. A text …

Feature selection for text classification using machine learning approaches

K Thirumoorthy, K Muneeswaran - National Academy Science Letters, 2022 - Springer
In the present scenario, millions of internet users are contributing a huge amount of data in
the form of unstructured text documents. In text classification, the high dimensional feature …

[PDF][PDF] A novel feature selection method based on category information analysis for class prejudging in text classification

Q Wang, Y Guan, X Wang, Z Xu - International Journal of …, 2006 - researchgate.net
This paper presents a new feature selection algorithm with the category information analysis
in text classification. The algorithm obscure or reduce the noises of text features by …

Class dependent feature scaling method using naive Bayes classifier for text datamining

E Youn, MK Jeong - Pattern Recognition Letters, 2009 - Elsevier
The problem of feature selection is to find a subset of features for optimal classification. A
critical part of feature selection is to rank features according to their importance for …

A study on mutual information-based feature selection for text categorization

Y Xu, GJF Jones, JT Li, B Wang, CM Sun - Journal of Computational …, 2007 - doras.dcu.ie
Feature selection plays an important role in text categorization. Automatic feature selection
methods such as document frequency thresholding (DF), information gain (IG), mutual …

New feature selection methods based on context similarity for text categorization

Y Chen, B Han, P Hou - 2014 11th International Conference on …, 2014 - ieeexplore.ieee.org
High dimensionality of the feature space is one of the most important concerns in text
categorization problems, and feature selection is widely used for reducing the …

A new feature selection method for text classification

G Uchyigit, K Clark - … Journal of Pattern Recognition and Artificial …, 2007 - World Scientific
Text classification is the problem of classifying a set of documents into a pre-defined set of
classes. A major problem with text classification problems is the high dimensionality of the …

Effective text classification by a supervised feature selection approach

T Basu, CA Murthy - 2012 ieee 12th international conference on …, 2012 - ieeexplore.ieee.org
The high dimensionality of data is a great challenge for effective text classification. Each
document in a document corpus contains many irrelevant and noisy information which …