Rich document representation and classification: An analysis

M Keikha, A Khonsari, F Oroumchian - Knowledge-Based Systems, 2009 - Elsevier
There are three factors involved in text classification. These are classification model,
similarity measure and document representation model. In this paper, we will focus on …

[PDF][PDF] Representation and classification of text documents: A brief review

BS Harish, DS Guru, S Manjunath - IJCA, Special Issue on …, 2010 - researchgate.net
Text classification is one of the important research issues in the field of text mining, where
the documents are classified with supervised knowledge. In literature we can find many text …

[PDF][PDF] Text Categorization: A comparison of classifiers, feature selection metrics and document representation

F Peleja, GP Lopes, J Silva - Proceedings of the 15th Portuguese …, 2011 - researchgate.net
In this paper, we compare several aspects related to automatic text categorization which
include document representation, feature selection, three classifiers, and their application to …

On combining classifier mass functions for text categorization

DA Bell, JW Guan, Y Bi - IEEE transactions on knowledge and …, 2005 - ieeexplore.ieee.org
Experience shows that different text classification methods can give different results. We look
here at a way of combining the results of two or more different classification methods using …

Semantic text classification: A survey of past and recent advances

B Altınel, MC Ganiz - Information Processing & Management, 2018 - Elsevier
Automatic text classification is the task of organizing documents into pre-determined classes,
generally using machine learning algorithms. Generally speaking, it is one of the most …

Text categorization: An experiment using phrases

M Kongovi, JC Guzman, V Dasigi - … BCS-IRSG European Colloquium on IR …, 2002 - Springer
Typical text classifiers learn from example and training documents that have been manually
categorized. In this research, our experiment dealt with the classification of news wire …

Text classification improved through multigram models

D Shen, JT Sun, Q Yang, Z Chen - Proceedings of the 15th ACM …, 2006 - dl.acm.org
Classification algorithms and document representation approaches are two key elements for
a successful document classification system. In the past, much work has been conducted to …

[PDF][PDF] Recording word position information for improved document categorization

P Gawrysiak, L Gancarz… - Proceedings of the …, 2002 - softlab.ece.ntua.gr
In this paper, which is a report from work in progress, we briefly present the new document
representation that could be used in classic text mining applications, such as document …

An experimental evaluation of OCR text representations for learning document classifiers

M Junker, R Hoch - International Journal on Document Analysis and …, 1998 - Springer
In the literature, many feature types are proposed for document classification. However, an
extensive and systematic evaluation of the various approaches has not yet been done. In …

[PDF][PDF] An evaluation of bag-of-concepts representations in automatic text classification

O Täckström - Recall, 2005 - Citeseer
Automatic text classification is the process of automatically classifying text documents into
pre-defined document classes. Traditionally, documents are represented in the so called …