Best terms: an efficient feature-selection algorithm for text categorization

D Fragoudis, D Meretakis, S Likothanassis - Knowledge and Information …, 2005 - Springer
In this paper, we propose a new feature-selection algorithm for text classification, called best
terms (BT). The complexity of BT is linear in respect to the number of the training-set …

[PDF][PDF] N-gram based Text Categorization

P Náther - Comenius University, Bratislava, Slovakia, 2005 - Citeseer
We live in the world where information have a great value and the amount of available
information (mostly on internet) has been expansively growing during last years. There are …

[PDF][PDF] Improving text categorization by using a topic model

W Sriurai - Advanced Computing, 2011 - Citeseer
Most text categorization algorithms represent a document collection as a Bag of Words
(BOW). The BOW representation is unable to recognize synonyms from a given term set and …

[PDF][PDF] Text classification by labeling words

B Liu, X Li, WS Lee, PS Yu - Aaai, 2004 - cdn.aaai.org
Traditionally, text classifiers are built from labeled training examples. Labeling is usually
done manually by human experts (or the users), which is a labor intensive and time …

[PDF][PDF] Text classification: a recent overview

M Ikonomakis, S Kotsiantis… - Proceedings of the 9th …, 2005 - researchgate.net
Text classification is becoming more and more important with the rapid growth of on-line
information available. This paper describes the text classification process. Of course, a …

[PDF][PDF] Text classification using machine learning techniques.

M Ikonomakis, S Kotsiantis, V Tampakas - WSEAS transactions on …, 2005 - Citeseer
Automated text classification has been considered as a vital method to manage and process
a vast amount of documents in digital forms that are widespread and continuously …

A comparison of word-and sense-based text categorization using several classification algorithms

A Kehagias, V Petridis, VG Kaburlasos… - Journal of Intelligent …, 2003 - Springer
Most of the text categorization algorithms in the literature represent documents as collections
of words. An alternative which has not been sufficiently explored is the use of word …

[引用][C] Feature extraction algorithms for classification of text documents

A Rahman, HA Babri, M Saeed - International Conference on Computer and …, 2012

[PDF][PDF] Representation quality in text classification: An introduction and experiment

DD Lewis - Speech and Natural Language: Proceedings of a …, 1990 - aclanthology.org
The way in which text is represented has a strong impact on the performance of text
classification (retrieval and categorization) systems. We discuss the operation of text …

Discriminative features for text document classification

K Torkkola - Formal Pattern Analysis & Applications, 2004 - Springer
The bag-of-words approach to text document representation typically results in vectors of the
order of 5000–20,000 components as the representation of documents. To make effective …