Hierarchical approach for scientific document classification

A D'cunha, AK Sen - International Conference on Computing …, 2015 - ieeexplore.ieee.org
Classification is the grouping of information or objects in predefined labeled categories
based on similarities. Exponential growth rates of scientific document collection leads to …

[PDF][PDF] Classification of scientific publications using swarm intelligence

T Ali, NA Sajid, S Asghar, M Ahmed - Proceedings of the Pakistan …, 2013 - paspk.org
Document classification is an important task in data mining. Currently, identifying category
(ie, topic) of a scientific publication is a manual task. The Association for Computing …

An Improved TF-IDF algorithm based on word frequency distribution information and category distribution information

H Wu, N Yuan - Proceedings of the 3rd International Conference on …, 2018 - dl.acm.org
Traditional TF-IDF (Term Frequency-Inverse Document Frequency) feature weighting
algorithm only uses word frequency information as a measure of the importance of feature …

[PDF][PDF] A technical study on feature ranking techniques and classification algorithms

W Sharif, NA Samsudin, MM Deris, SKA Khalid - J. Eng. Appl. Sci, 2018 - researchgate.net
Since, electronic documents are dramatically increasing therefore document classification
becomes a very important task to organise inormation automatically. Text documents are a …

An improved tf-idf method for calculating text feature weight

P Li - International Core Journal of Engineering, 2021 - airitilibrary.com
The feature weight algorithm of the text can calculate the classification accuracy of the entire
text. The traditional tf-idf (Term Frequency Inverse Document Frequency) algorithm is only …

Applying Machine Learning Algorithms for News Articles Categorization: Using SVM and kNN with TF-IDF Approach

Kanika, Sangeeta - Smart Computational Strategies: Theoretical and …, 2019 - Springer
News articles categorization is a supervised learning approach in which news articles are
assigned category labels based on likelihood demonstrated by a training set of labeled …

[PDF][PDF] A new enhanced variation of TF-IDF scheme for Arabic text classification

FS Al-Anzi, D AbuZeina - Health, 2016 - iieng.org
Text Classification (TC) is a popular information retrieval (IR) technique that mainly employs
features selection, features reduction, and features weighting techniques. The most common …

Document indexing in text categorization

QR Zhang, L Zhang, SB Dong… - … Conference on Machine …, 2005 - ieeexplore.ieee.org
Aiming at the characteristic of text categorization, this paper proposes an improved method
of computing term weights, tfidfie, based on the traditional tfidf function that is generally used …

An improved approach to terms weighting in text classification

Z Ma, J Feng, L Chen, X Hu… - … Conference on Computer …, 2011 - ieeexplore.ieee.org
Most of traditional text classification methods utilize term frequency (tf) and inverse
document frequency (idf) for representing importance of terms and computing weighting of …

A comparative study on feature weight in text categorization

ZH Deng, SW Tang, DQ Yang, MZLY Li… - … Web Technologies and …, 2004 - Springer
Text Categorization is the process of automatically assigning predefined categories to free
text documents. Feature weight, which calculates feature (term) values in documents, is one …