[PDF][PDF] A hybrid document features extraction with clustering based classification framework on large document sets

SA Devi, SS Kumar - International Journal of Advanced …, 2020 - pdfs.semanticscholar.org
As the size of the document collections are increasing day-by-day, finding an essential
document clusters for classification problem is one of the major problem due to high inter …

[PDF][PDF] A cluster based approach with n-grams at word level for document classification

A Khabia, MB Chandak - International Journal of Computer Applications, 2015 - Citeseer
ABSTRACT A breakneck progress of computers and web makes it easier to collect and store
large amount of information in the form of text; eg, reviews, forum postings, blogs, web …

Document clustering analysis with aid of adaptive Jaro Winkler with Jellyfish search clustering algorithm

P Pitchandi, M Balakrishnan - Advances in Engineering Software, 2023 - Elsevier
In this research, document clustering is analyzed with the help of Adaptive Jaro Winkler with
Jellyfish Search Clustering (AJWJSC) algorithm and Chimp Optimization Algorithm (COA) …

A feature selection method for improved document classification

T Basu, CA Murthy - Advanced Data Mining and Applications: 8th …, 2012 - Springer
The aim of text document classification is to automatically group a document to a predefined
class. The main problem of document classification is high dimensionality and sparsity of the …

[PDF][PDF] Document classification model using Web documents for balancing training corpus size per category

SY Park, J Chang, T Kihl - Journal of information and …, 2013 - koreascience.kr
In this paper, we propose a document classification model using Web documents as a part
of the training corpus in order to resolve the imbalance of the training corpus size per …

A fuzzy similarity based concept mining model for text classification

S Puri - arXiv preprint arXiv:1204.2061, 2012 - arxiv.org
Text Classification is a challenging and a red hot field in the current scenario and has great
importance in text categorization applications. A lot of research work has been done in this …

Efficient fuzzy similarity-based text classification with SVM and feature reduction

S Puri - Congress on Intelligent Systems: Proceedings of CIS …, 2021 - Springer
With the generation of enormous data day by day, the need of feature reduction has
tremendously increased in the field of text classification. In this direction, this paper presents …

Text classification toward a scientific forum

W Zhang, X Tang, T Yoshida - Journal of Systems Science and Systems …, 2007 - Springer
Text mining, also known as discovering knowledge from the text, which has emerged as a
possible solution for the current information explosion, refers to the process of extracting non …

Dwsa: An intelligent document structural analysis model for information extraction and data mining

T Yue, Y Li, Z Hu - Electronics, 2021 - mdpi.com
The structure of a document contains rich information such as logical relations in context,
hierarchy, affiliation, dependence, and applicability. It will greatly affect the accuracy of …

Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering

LM Abualigah, AT Khader - The Journal of Supercomputing, 2017 - Springer
The text clustering technique is an appropriate method used to partition a huge amount of
text documents into groups. The documents size affects the text clustering by decreasing its …