Opportunistic data structures with applications

P Ferragina, G Manzini - Proceedings 41st annual symposium …, 2000 - ieeexplore.ieee.org
We address the issue of compressing and indexing data. We devise a data structure whose
space occupancy is a function of the entropy of the underlying data set. We call the data …

Compressed suffix arrays and suffix trees with applications to text indexing and string matching

R Grossi, JS Vitter - Proceedings of the thirty-second annual ACM …, 2000 - dl.acm.org
The proliferation of online text, such as on the World Wide Web and in databases, motivates
the need for space-efficient index methods that support fast search. Consider a text T of n …

A review on document image analysis techniques directly in the compressed domain

M Javed, P Nagabhushan, BB Chaudhuri - Artificial Intelligence Review, 2018 - Springer
The rapid growth of digital libraries, e-governance, and internet based applications has
caused an exponential escalation in the volume of 'Big-data'particularly due to texts, images …

Fast randomized singular value thresholding for nuclear norm minimization

TH Oh, Y Matsushita, YW Tai… - Proceedings of the …, 2015 - openaccess.thecvf.com
Rank minimization problem can be boiled down to either Nuclear Norm Minimization (NNM)
or Weighted NNM (WNNM) problem. The problems related to NNM (or WNNM) can be …

Compressive biological sequence analysis and archival in the era of high-throughput sequencing technologies

R Giancarlo, SE Rombo, F Utro - Briefings in bioinformatics, 2014 - academic.oup.com
High-throughput sequencing technologies produce large collections of data, mainly DNA
sequences with additional information, requiring the design of efficient and effective …

[PDF][PDF] Byte pair encoding: A text compression scheme that accelerates pattern matching

Y Shibata, T Kida, S Fukamachi, M Takeda… - 1999 - researchgate.net
Byte pair encoding (BPE) is a simple universal text compression scheme. Decompression is
very fast and requires small work space. Moreover, it is easy to decompress an arbitrary part …

Fast and flexible word searching on compressed text

E Silva de Moura, G Navarro, N Ziviani… - ACM Transactions on …, 2000 - dl.acm.org
We present a fast compression technique for natural language texts. The novelties are that
(1) decompression of arbitrary portions of the text can be done very efficiently,(2) exact …

[PDF][PDF] String matching in Lempel-Ziv compressed strings

M Farach, M Thorup - Proceedings of the twenty-seventh annual ACM …, 1995 - dl.acm.org
String matching and Compression are two widely studied areas of computer science. The
theory of string matching has a long association with compression algorithms. Data …

An experimental study of an opportunistic index.

P Ferragina, G Manzini - SODA, 2001 - books.google.com
The size of electronic data is currently growing at a faster rate than computer memory and
disk storage capacities. For this reason compression appears always as an attractive choice …

Algorithmics on SLP-compressed strings: A survey

M Lohrey - Groups-Complexity-Cryptology, 2012 - degruyter.com
Results on algorithmic problems on strings that are given in a compressed form via straight-
line programs are surveyed. A straight-line program is a context-free grammar that …