Using the self organizing map for clustering of text documents

D Isa, VP Kallimani, LH Lee - Expert Systems with Applications, 2009 - Elsevier
An increasing number of computational and statistical approaches have been used for text
classification, including nearest-neighbor classification, naïve Bayes classification, support …

[PDF][PDF] Text Document Pre-Processing Using the Bayes Formula for Classification Based on the Vector Space Model.

D Isa, LH Lee, VP Kallimani, R Rajkumar - Comput. Inf. Sci., 2008 - core.ac.uk
This work utilizes the Bayes formula to vectorize a document according to a probability
distribution based on keywords reflecting the probable categories that the document may …

[PDF][PDF] Is Naive Bayes a good classifier for document classification

SL Ting, WH Ip, AHC Tsang - International Journal of Software …, 2011 - researchgate.net
Document classification is a growing interest in the research of text mining. Correctly
identifying the documents into particular category is still presenting challenge because of …

Data mining for text categorization with semi‐supervised agglomerative hierarchical clustering

AG Skarmeta, A Bensaid, N Tazi - International Journal of …, 2000 - Wiley Online Library
In this paper we study the use of a semi‐supervised agglomerative hierarchical clustering
(ssAHC) algorithm to text categorization, which consists of assigning text documents to …

Classification of text documents

YH Li, AK Jain - The Computer Journal, 1998 - academic.oup.com
The exponential growth of the internet has led to a great deal of interest in developing useful
and efficient tools and software to assist users in searching the Web. Document retrieval …

Classifying web documents in a hierarchy of categories: a comprehensive study

M Ceci, D Malerba - Journal of Intelligent Information Systems, 2007 - Springer
Most of the research on text categorization has focused on classifying text documents into a
set of categories with no structural relationships among them (flat classification). However, in …

Text document preprocessing with the Bayes formula for classification using the support vector machine

D Isa, LH Lee, VP Kallimani… - IEEE Transactions on …, 2008 - ieeexplore.ieee.org
This work implements an enhanced hybrid classification method through the utilization of the
naïve Bayes classifier and the Support Vector Machine (SVM). In this project, the Bayes …

[PDF][PDF] A comparative approach of dimensionality reduction techniques in text classification

SR Basha, JK Rani - Engineering, Technology & Applied …, 2019 - pdfs.semanticscholar.org
This work deals with document classification. It is a supervised learning method (it needs a
labeled document set for training and a test set of documents to be classified). The …

Using unsupervised clustering approach to train the Support Vector Machine for text classification

N Shafiabady, LH Lee, R Rajkumar, VP Kallimani… - Neurocomputing, 2016 - Elsevier
The use of learning algorithms for text classification assumes the availability of a large
amount of documents which have been organized and labeled correctly by human experts …

[PDF][PDF] Supervised and unsupervised machine learning techniques for text document categorization

A Ozgur - Unpublished Master's Thesis, İstanbul: Boğaziçi …, 2004 - Citeseer
Automatic organization of documents has become an important research issue since the
explosion of digital and online text information. There are mainly two machine learning …