[PDF][PDF] A Comparative Study on Chinese Text Categorization Methods.

J He, AH Tan, CL Tan - PRICAI workshop on text and web mining, 2000 - researchgate.net
This paper reports our comparative evaluation of three machine learning methods on
Chinese text categorization. Whereas a wide range of methods have been applied to …

Semantic text classification: A survey of past and recent advances

B Altınel, MC Ganiz - Information Processing & Management, 2018 - Elsevier
Automatic text classification is the task of organizing documents into pre-determined classes,
generally using machine learning algorithms. Generally speaking, it is one of the most …

[引用][C] Research on the algorithm of feature selection based on Gini index for text categorization.

W Shang, H Huang, Y Liu, Y Lin, Y Qu, H Dong - Jisuanji Yanjiu yu Fazhan(Computer …, 2006

On entropy-based term weighting schemes for text categorization

T Wang, Y Cai, H Leung, RYK Lau, H Xie… - Knowledge and Information …, 2021 - Springer
Abstract In text categorization, Vector Space Model (VSM) has been widely used for
representing documents, in which a document is represented by a vector of terms. Since …

A Review on Comparison of Machine Learning Algorithms for Text Classification

M Dhingra, D Dhabliya, MK Dubey… - … and Informatics (IC3I …, 2022 - ieeexplore.ieee.org
The majority of the data is preserved as text (about 75%), hence It is believed that text
mining has a significant commercial potential. Unstructured texts continue to be the most …

Automatic text categorization: Marathi documents

JJ Patil, N Bogiri - 2015 International Conference on Energy …, 2015 - ieeexplore.ieee.org
Information technology generated huge data on the internet. Initially this data is mainly in
English language so majority of data mining research work is on the English text documents …

Abstract feature extraction for text classification

G BİRİCİK, B Diri, AC SÖNMEZ - Turkish Journal of Electrical …, 2012 - journals.tubitak.gov.tr
Feature selection and extraction are frequently used solutions to overcome the curse of
dimensionality in text classification problems. We introduce an extraction method that …

[引用][C] Similarity-based techniques for text document classification

SS Karman, N Ramaraj - Int. J. SoftComput, 2008

Text categorization using neural networks initialized with decision trees

N Remeikis, I Skučas, V Melninkaitė - Informatica, 2004 - content.iospress.com
Text categorization–the assignment of natural language documents to one or more
predefined categories based on their semantic content–is an important component in many …

Research on Chinese text classification based on Word2vec

ZT Yang, J Zheng - 2016 2nd IEEE International Conference on …, 2016 - ieeexplore.ieee.org
The set of features which the traditional feature selection algorithm of chi-square selected is
not complete. This causes the low performance for the final text classification. Therefore, this …