[PDF][PDF] Preprocessing for PPM: compressing UTF-8 encoded natural language text

WJ Teahan, KM Alhawiti - International Journal of Computer Science …, 2015 - academia.edu
In this paper, several new universal preprocessing techniques are described to improve
Prediction by Partial Matching (PPM) compression of UTF-8 encoded natural language text …

[图书][B] Grammar-based preprocessing for PPM compression and classification

N Aljehane - 2018 - search.proquest.com
The aim of this study is to investigate the efficiency of novel methods using context-free
grammars and Prediction by Partial Matching (PPM) in order to build and evaluate the …

[PDF][PDF] Grammar based pre-processing for PPM

WJ Teahan, NO Aljehane - Int. J. Comput. Sci. Inf. Secur, 2017 - researchgate.net
In this paper, we apply grammar-based pre-processing prior to using the Prediction by
Partial Matching (PPM) compression algorithm. This achieves significantly better …

[图书][B] Categorisation of Arabic Twitter Text

MHR Altamimi - 2020 - search.proquest.com
The shortage of Arabic language resources in the field of corpus linguistics compared to
other popular languages such as English, Chinese and Spanish inspired this work. The …

[图书][B] Compression-based Methods for the Automatic Cryptanalysis of Classical Ciphers

NR Al-Kazaz - 2019 - search.proquest.com
The study documented in this thesis investigates the effectiveness of compression in the
field of cryptanalysis, specifically for the automatic cryptanalysis of classical ciphers, initially …

[引用][C] PREPROCESSING FOR PPM: COMPRESSING UTF-8