Representation of texts into string vectors for text categorization

TH Jo - Journal of Computing Science and Engineering, 2010 - koreascience.kr
In this study, we propose a method for encoding documents into string vectors, instead of
numerical vectors. A traditional approach to text categorization usually requires encoding …

[PDF][PDF] Neural text categorizer for exclusive text categorization

TH Jo - Journal of Information Processing Systems, 2008 - jips-k.org
This research proposes a new neural network for text categorization which uses alternative
representations of documents to numerical vectors. Since the proposed neural network is …

Support tensor machines for text categorization

D Cai, X He, JR Wen, J Han, WY Ma - 2006 - ideals.illinois.edu
We consider the problem of text representation and categorization. Conventionally, a text
document is represented by a vector in high dimensional space. Some learning algorithms …

[PDF][PDF] Text categorization and support vector machines

I Pilászy - Proceedings of the 6th international symposium of …, 2005 - uni-obuda.hu
Text categorization is used to automatically assign previously unseen documents to a
predefined set of categories. This paper gives a short introduction into text categorization …

Research on text categorization based on machine learning

Y Wanjun, S Xiaoguang - 2010 IEEE International Conference …, 2010 - ieeexplore.ieee.org
In recent years, text categorization based on machine learning is a widely used technology
in the field of information retrieval and text mining and has gained many advances. This …

Distributional features for text categorization

XB Xue, ZH Zhou - IEEE Transactions on Knowledge and Data …, 2008 - ieeexplore.ieee.org
Text categorization is the task of assigning predefined categories to natural language text.
With the widely used'bag of words' representation, previous researches usually assign a …

Feature selection using support vector machines

J Brank, M Grobelnik, N Milic-Frayling… - WIT Transactions on …, 2002 - witpress.com
Text categorization is the task of classifying natural language documents into a set of
predefine categories. Documents are typically represented by sparse vectors under the …

Rich document representation and classification: An analysis

M Keikha, A Khonsari, F Oroumchian - Knowledge-Based Systems, 2009 - Elsevier
There are three factors involved in text classification. These are classification model,
similarity measure and document representation model. In this paper, we will focus on …

Classifying text documents by associating terms with text categories

OR Zaiane, ML Antonie - Proceedings of the 13th Australasian database …, 2002 - dl.acm.org
Automatic text categorization has always been an important application and research topic
since the inception of digital documents. Today, text categorization is a necessity due to the …

Class-dependent projection based method for text categorization

L Chen, G Guo, K Wang - Pattern Recognition Letters, 2011 - Elsevier
Text categorization presents unique challenges to traditional classification methods due to
the large number of features inherent in the datasets from real-world applications of text …