作者
S Akshay, K Nayana, S Karthika
发表日期
2015
期刊
International International Journal of Applied Engineering Research ISSN
卷号
10
期号
10
页码范围
0973-4562
出版商
Research India Publications
简介
Internet is a pool of information, which contains billions of text documents which are stored in compressed format. In literature there are many text classification algorithms which work on uncompressed text documents. Since web pages contain text data which are stored in compressed format and the text documents must be taken back to its original format for the purpose of data mining activities. The process of decompression of text documents consumes more computational time. So this work introduces a study on different text classification and clustering algorithms and their comparison in compressed domain. Various methods for representing text in compressed domain are explained and experiments are conducted on LZW method for comparison. Different classification and clustering algorithms are also discussed. A comparative analysis on all these methods is presented.
引用总数
20172018201920202021202220236212
学术搜索中的文章
S Akshay, K Nayana, S Karthika - … International Journal of Applied Engineering Research, 2015