[PDF][PDF] A novel feature selection score for text categorization

S Eyheramendy, D Madigan - Proceedings of the Workshop on Feature …, 2005 - siam.org
This paper proposes a new feature selection score for text classification. The value that this
score assigns to each feature has an appealing Bayesian interpretation, being the posterior …

Classification of text documents

YH Li, AK Jain - The Computer Journal, 1998 - academic.oup.com
The exponential growth of the internet has led to a great deal of interest in developing useful
and efficient tools and software to assist users in searching the Web. Document retrieval …

[PDF][PDF] A review of machine learning algorithms for text-documents classification

A Khan, B Baharudin, LH Lee, K Khan - Journal of advances in …, 2010 - academia.edu
With the increasing availability of electronic documents and the rapid growth of the World
Wide Web, the task of automatic categorization of documents became the key method for …

A simple feature selection method for text classification

P Soucy, GW Mineau - Proceedings of the 17th international joint …, 2001 - dl.acm.org
In text classification most techniques use bag-of-words to represent documents. The main
problem is to identify what words are best suited to classify the documents in such a way as …

Survey on supervised machine learning techniques for automatic text classification

AI Kadhim - Artificial intelligence review, 2019 - Springer
Supervised machine learning studies are gaining more significant recently because of the
availability of the increasing number of the electronic documents from different resources …

A novel feature selection technique for text classification

DS Guru, M Ali, M Suhil - Emerging Technologies in Data Mining and …, 2019 - Springer
In this paper, a new feature selection technique called Term-Class Weight-Inverse-Class
Frequency is proposed for the purpose of text classification. The technique is based on …

Supervised two-step feature extraction for structured representation of text data

O Háva, M Skrbek, P Kordík - Simulation Modelling Practice and Theory, 2013 - Elsevier
Training data matrix used for classification of text documents to multiple categories is
characterized by large number of dimensions while the number of manually classified …

[PDF][PDF] Document classification using various classification algorithms: a survey

B Kaur, G Bathla - Int J Fut Revol Comput Sci Commun Eng, 2018 - academia.edu
Text classification is used to classify the document of similar types. Text classification can be
also performed under supervision ie it is an supervised leaning technique Text classification …

[PDF][PDF] Benchmarking text collections for classification and clustering tasks

RG Rossi, RM Marcacini, SO Rezende - 2013 - repositorio.usp.br
Several text mining techniques have been proposed to deal with the huge number of textual
documents that are available and that have been published nowadays. Mainly classification …

A novel framework for termset selection and weighting in binary text classification

D Badawi, H Altınçay - Engineering Applications of Artificial Intelligence, 2014 - Elsevier
This study presents a new framework for termset selection and weighting. The proposed
framework is based on employing the joint occurrence statistics of pairs of terms for termset …