Revisiting dictionary‐based compression

P Skibiński, S Grabowski… - Software: Practice and …, 2005 - Wiley Online Library
An attractive way to increase text compression is to replace words with references to a text
dictionary given in advance. Although there exist a few works in this area, they do not fully …

Compression of small text files

J Platoš, V Snášel, E El-Qawasmeh - Advanced engineering informatics, 2008 - Elsevier
This paper suggests a novel compression scheme for small text files. The proposed scheme
depends on Boolean minimization of binary data accompanied with the adoption of Burrows …

n‐Gram‐Based Text Compression

VH Nguyen, HT Nguyen, HN Duong… - Computational …, 2016 - Wiley Online Library
We propose an efficient method for compressing Vietnamese text using n‐gram dictionaries.
It has a significant compression ratio in comparison with those of state‐of‐the‐art methods …

Lempel-Ziv compression of structured text

J Adiego, G Navarro… - … , 2004. Proceedings. DCC …, 2004 - ieeexplore.ieee.org
We describe a novel Lempel-Ziv approach suitable for compressing structured documents,
called LZCS, which takes advantage of redundant information that can appear in the …

Classification of compressed and uncompressed text documents

SNB Bhushan, A Danti - Future Generation Computer Systems, 2018 - Elsevier
Computing the degree of closeness (similarity) between two sets of text documents is one of
the core operations in many text mining applications like text classification, clustering and …

Using structural contexts to compress semistructured text collections

J Adiego, G Navarro, P de la Fuente - Information Processing & …, 2007 - Elsevier
We describe a compression model for semistructured documents, called Structural Contexts
Model (SCM), which takes advantage of the context information usually implicit in the …

Natural language compression on edge-guided text preprocessing

MA Martínez-Prieto, J Adiego, P de la Fuente - Information Sciences, 2011 - Elsevier
This paper presents Edge-Guided (EG), an optimized text preprocessing technique for
compression purposes. It transforms the original text into a word net, which stores all …

[PDF][PDF] A Comparison between English and Arabic text compression

ZM Alasmer, BM Zahran, BA Ayyoub, MA Kanan… - Journal of Contemporary …, 2013 - Citeseer
A Comparison between applying two Techniques that compress document data in both
languages Arabic and English is introduced. In order to compress the data document, two or …

Lempel‐Ziv compression of highly structured documents

J Adiego, G Navarro, P Fuente - Journal of the American …, 2007 - Wiley Online Library
Abstract The authors describe Lempel‐Ziv to Compress Structure (LZCS), a novel Lempel–
Ziv approach suitable for compressing structured documents. LZCS takes advantage of …

An Efficient Compression Scheme for Natural Language Text by Hashing

MA Mahmood, KMA Hasan - SN Computer Science, 2022 - Springer
Data compression means the route towards adjusting, encoding or changing the bit structure
of information so that it requires less space. The fundamental standard behind compression …