A new text clustering method based on Huffman encoding algorithm

M Muntean, L Căbulea, H Vălean - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
Clustering of text data is a widely studied data mining problem and has a number of
applications such as spam detection, document organization and indexing, IP-address …

XML compression improvements based on the clustering of elements

P Hruška, J Martinovič, J Dvorský… - … Systems and Industrial …, 2010 - ieeexplore.ieee.org
XML compression improvements based on the clustering of elements Page 1 XML Compression
Improvements Based on the Clustering of Elements Pavel Hruška, Jan Martinovic, Jirı Dvorský …

Simple Rules for Syllabification of Arabic Texts

H Soori, J Platos, V Snasel, H Abdulla - … , Ostrava, Czech Republic, July 7-9 …, 2011 - Springer
The Arabic language is the sixth most used language in the world today. It is also used by
United Nation. Moreover, the Arabic alphabet is the second most widely used alphabet …

Improvement of Text Compression Using Subset of Words

J Platos - Advanced Science Letters, 2014 - ingentaconnect.com
This paper describes a novel approach to the text compression based on the combination of
the characters and words approach. New approach uses subset of words for improvement of …

Document Compression Improvements Based on Data Clustering

J Dvorský, J Martinovic, J Platoš… - Web Intelligence and …, 2010 - books.google.com
The modern information society produces immense quantities of textual information. Storing
text effectively and searching necessary information in stored texts are the tasks for …

[引用][C] Data Compression Approach for Plagiarism Detection

HKH Soori - 2016 - dspace.vsb.cz
In our digital era, the need for plagiarism detection tools is growing with the tremendous
number of documents produced on daily basis in and outside academia in all fields of …