作者
R Dinesh, BS Harish, Devanur S Guru, S Manjunath
发表日期
2009/12
研讨会论文
IICAI
页码范围
2071-2079
简介
In this paper we propose a new method of classifying text documents. Unlike conventional vector space models, the proposed method preserves the sequence of term occurrence in a document. The term sequence is effectively preserved with the help of a novel datastructure called ‘Status Matrix’. Further the corresponding classification technique has been proposed for efficient classification of text documents. To corroborate the efficacy of the proposed representation and classification methods, we have conducted extensive experiments on very large datasets including few benchmark datasets. Also the superiority of the proposed method has been established by comparing the results of the proposed method with that of very well accepted contemporary methods. The experimental results reveal that the proposed method outperforms the existing methods both in terms of representation as well as classification …
引用总数
201020112012201320142015201620172018551331111
学术搜索中的文章