[PDF][PDF] Feature extraction based text classification using k-nearest neighbor algorithm

M Azam, T Ahmed, F Sabah… - IJCSNS Int. J. Comput …, 2018 - researchgate.net
Scientific publications has been increasing enormously, with this increase classification of
scientific publications is becoming challenging task. The core objective of this research is to …

[PDF][PDF] A topical document clustering method

S Zhao, T Liu, S Li - Journal of Chinese information processing, 2007 - cips-cl.org
Few of the existing document clustering methods can detect or describe document topics
properly, which makes it difficult to conduct clustering based on topics. In this paper, we …

[引用][C] A two-stage text feature selection algorithm for improving text classification

P Ashokkumar, SG Shankar, G Srivastava… - ACM Transactions on Asian …, 2021

Combined chi-square with k-means for document clustering

AI Kadhim, AK Jassim - IOP Conference Series: Materials …, 2021 - iopscience.iop.org
Currently, the dynamic website has increased with more than thousands of documents
associated to a category topic available. Most of the website documents are unstructured …

[图书][B] Feature selection and enhanced krill herd algorithm for text document clustering

LMQ Abualigah - 2019 - Springer
1.1 Background With the growth of the amount of text information on Internet web pages and
modern applications, in general, interest in the text analysis area has increased to facilitate …

An iterative hybrid filter-wrapper approach to feature selection for document clustering

MA Jashki, M Makki, E Bagheri… - Advances in Artificial …, 2009 - Springer
The manipulation of large-scale document data sets often involves the processing of a
wealth of features that correspond with the available terms in the document space. The …

Classifying text documents by associating terms with text categories

OR Zaiane, ML Antonie - Proceedings of the 13th Australasian database …, 2002 - dl.acm.org
Automatic text categorization has always been an important application and research topic
since the inception of digital documents. Today, text categorization is a necessity due to the …

Text document classification with pca and one-class svm

B Shravan Kumar, V Ravi - … of the 5th International Conference on …, 2017 - Springer
We propose a document classifier based on principal component analysis (PCA) and one-
class support vector machine (OCSVM), where PCA helps achieve dimensionality reduction …

New methods for text categorization based on a new feature selection method and a new similarity measure between documents

LW Lee, SM Chen - … Conference on Industrial, Engineering and Other …, 2006 - Springer
In this paper, we present a new feature selection method based on document frequencies
and statistical values. We also present a new similarity measure to calculate the degree of …

[PDF][PDF] Document clustering: a detailed review

N Shah, S Mahajan - International Journal of Applied Information …, 2012 - academia.edu
Document clustering is automatic organization of documents into clusters so that documents
within a cluster have high similarity in comparison to documents in other clusters. It has been …