[PDF][PDF] A cluster based approach with n-grams at word level for document classification

A Khabia, MB Chandak - International Journal of Computer Applications, 2015 - Citeseer
ABSTRACT A breakneck progress of computers and web makes it easier to collect and store
large amount of information in the form of text; eg, reviews, forum postings, blogs, web …

[PDF][PDF] Clustering of words based on relative contribution for text categorization

JM Yang, ZY Liu, ZY Qu - IAENG International Journal of Computer …, 2013 - iaeng.org
Term clustering tries to group words based on the similarity criterion between words, so that
the groups can be used as the dimensions of the vector space in the text categorization. We …

[PDF][PDF] Survey on feature selection in document clustering

K Mugunthadevi, S Punitha, M Punithavalli… - International Journal on …, 2011 - Citeseer
Text mining is to research technologies to discover useful knowledge from enormous
collections of documents, and to develop a system to provide knowledge and to support in …

[PDF][PDF] Text categorization of documents using K-means and K-means++ clustering algorithm

AA Shetkar, S Fernandes - Int. J Recent Innov. Trends Comput …, 2016 - core.ac.uk
Text categorization is the technique used for sorting a set of documents into categories from
a predefined set. Text categorization is useful in better management and retrieval of the text …

[PDF][PDF] Document representation techniques and their effect on the document Clustering and Classification: A Review.

K Singh, HM Devi, AK Mahanta - International Journal of …, 2017 - researchgate.net
Text data is the most common form of storing information. When engine search an query,
user obtained the large collection of text data. All this retrieve text data are not relevant to the …

[PDF][PDF] Text document clustering and classification using k-means algorithm and neural networks

R Kaur, A Kaur - Indian Journal of Science and …, 2016 - sciresol.s3.us-east-2.amazonaws …
This paper demonstrated the outcomes of the research of a number of general document
clustering and classification methods. Objectives: This research improves the clustering. Its …

A new document clustering algorithm based on association rule

JC Song, JY Shen, QB Song - Proceedings of 2004 …, 2004 - ieeexplore.ieee.org
Owing to the widely application in the fields of information retrieval, document analysis and
information extraction, document cluster analysis has been concerned broadly, and gotten a …

Text categorization based on clustering feature selection

X Zhou, Y Hu, L Guo - Procedia Computer Science, 2014 - Elsevier
In this paper, we discuss a text categorization method based on k-means clustering feature
selection. K-means is classical algorithm for data clustering in text mining, but it is seldom …

Document classification using artificial neural network

K Tripathi, RG Vyas, AK Gupta - Asian Journal of Computer Science and …, 2019 - ajcst.co
The Document classification system is the field of data mining in which the format of data is
based on bag of words (BoW) or document vector model and the task is to build a machine …

Automatic word clustering for text categorization using global information

C Wenliang, C Xingzhi, W Huizhen, Z Jingbo… - … Symposium, AIRS 2004 …, 2005 - Springer
High dimensionality of feature space and short of training documents are the crucial
obstacles for text categorization. In order to overcome these obstacles, this paper presents a …