Wordbased text compression

A Moffat - Software: Practice and Experience, 1989 - Wiley Online Library
text at a level higher than character by character. Bentley et uZ.~ have described a word-based
‘move-to-thefront’ (MTF) compression … their scheme can represent English text in 3 to 4 …

[PDF][PDF] Constructing Word-Based Text Compression Algorithms.

RN Horspool, GV Cormack - Data compression conference, 1992 - webhome.cs.uvic.ca
… the problem of generalizing a compression algorithm to be word-based, then particular …
word-based algorithm for text file compression. And of the possibilities considered, word-based

Word-based text compression

J Platos, J Dvorsky - arXiv preprint arXiv:0804.3680, 2008 - arxiv.org
wordbased compression methods based on Huffman encoding, LZW or BWT were tested.
This paper describes word-based compressionwordbased or classic compression algorithms. …

Word-based text compression using the Burrows–Wheeler transform

A Moffat, RYK Isal - Information processing & management, 2005 - Elsevier
… 1 lists some text files extracted from the Calgary corpus, and the standard and large Canterbury
corpora. We focus on text files because of the very nature of word-based compression. In …

Development of word-based text compression algorithm for Indonesian language document

A Sinaga, H Nugroho - 2015 3rd International Conference on …, 2015 - ieeexplore.ieee.org
… to modify the wordbased text compression algorithm for Indonesian … (Word Based Lempel
Ziv Welch) Compression algorithm [1]. The main objective is to increase the compression ratio …

Word-based compression methods for large text documents

J Dvorský, J Pokorný, V Snásel - Data Compression Conference, 1999 - computer.org
… In this article we present a new compression method, called WLZW, which is a word-based
Hu_Word [3] compression algorithm. Due to special using WLZW in text databases, some its …

On compression-based text classification

Y Marton, N Wu, L Hellerstein - … Retrieval: 27th European Conference on IR …, 2005 - Springer
text classification methods are word-based; they treat a text document as a collection of words
(or stems). In contrast, nearly all research on compression-… over word-based methods, in …

A study on wordbased and integral‐bit Chinese text compression algorithms

KS Cheng, GH Young, KF Wong - Journal of the American …, 1999 - Wiley Online Library
compressor, COMP-2, it demonstrates a faster compression … performance of the compression
algorithm (compression ratio) … retrieval to design word-based compression algorithms for …

Word-based dynamic algorithms for data compression

J Jiang, S Jones - IEE Proceedings I (Communications, Speech and …, 1992 - IET
compression algorithms developed so far have include text … new word-based dynamic
Lempel-Ziv compression algorithm (… for compressing text, but which still can be used to compress

Word-Based Compression Methods and Indexing for Text Retrieval Systems.

J Dvorský, J Pokorný, V Snasel - ADBIS, 1999 - Springer
… In this article we present a new compression method, called WLZW, which is a word-based
in the HuffWord compression algorithm. The algorithm is two-phase, the compression ratio …