Improved document categorization through feature-rich combinations

A El Kah, I Zeroual - The International Conference on Artificial Intelligence …, 2021 - Springer
Several comparatives studies report new findings relevant to the Text Categorization (TC)
task, and all provide valuable observations. However, many of them addressed western …

Using typical testors for feature selection in text categorization

A Pons-Porrata, R Gil-García… - Progress in Pattern …, 2007 - Springer
A major difficulty of text categorization problems is the high dimensionality of the feature
space. Thus, feature selection is often performed in order to increase both the efficiency and …

Scoring and selecting terms for text categorization

E Montanes, I Diaz, J Ranilla… - IEEE Intelligent …, 2005 - ieeexplore.ieee.org
We propose a set of (machine learning) ML-based scoring measures for conducting feature
selection. We've tested these measures on documents from two well-known corpora …

Text categorization using association rule and naive Bayes classifier

SM Kamruzzaman, CM Rahman - arXiv preprint arXiv:1009.4994, 2010 - arxiv.org
As the amount of online text increases, the demand for text categorization to aid the analysis
and management of text is increasing. Text is cheap, but information, in the form of knowing …

A survey of text classification algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …

A comparative study on using principle component analysis with different text classifiers

AI Taloba, DA Eisa, SSI Ismail - arXiv preprint arXiv:1807.03283, 2018 - arxiv.org
Text categorization (TC) is the task of automatically organizing a set of documents into a set
of pre-defined categories. Over the last few years, increased attention has been paid to the …

Feature selection strategies for text categorization

P Soucy, GW Mineau - Advances in Artificial Intelligence: 16th Conference …, 2003 - Springer
Feature selection is an important research issue in text categorization. The reason for this is
that thousands of features are often involved, even when the simplest document …

Cascaded feature selection in SVMs text categorization

T Masuyama, H Nakagawa - International Conference on Intelligent Text …, 2003 - Springer
This paper investigates the effect of a cascaded feature selection (CFS) in SVMs text
categorization. Unlike existing feature selections, our method (CFS) has two advantages …

A novel feature selection technique for text classification using Naive Bayes

S Dey Sarkar, S Goswami, A Agarwal… - International scholarly …, 2014 - Wiley Online Library
With the proliferation of unstructured data, text classification or text categorization has found
many applications in topic classification, sentiment analysis, authorship identification, spam …

A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm

H Uğuz - Knowledge-Based Systems, 2011 - Elsevier
Text categorization is widely used when organizing documents in a digital form. Due to the
increasing number of documents in digital form, automated text categorization has become …