Spaces, trees, and colors: The algorithmic landscape of document retrieval on sequences

G Navarro - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Document retrieval is one of the best-established information retrieval activities since
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …

Top-k Document Retrieval in Optimal Time and Linear Space

G Navarro, Y Nekrich - Proceedings of the twenty-third annual ACM-SIAM …, 2012 - SIAM
We describe a data structure that uses O (n)-word space and reports k most relevant
documents that contain a query pattern P in optimal O (| P|+ k) time. Our construction …

Colored range queries and document retrieval

T Gagie, J Kärkkäinen, G Navarro, SJ Puglisi - Theoretical Computer …, 2013 - Elsevier
Colored range queries are a well-studied topic in computational geometry and database
research that, in the past decade, have found exciting applications in information retrieval. In …

Inverted indexes for phrases and strings

M Patil, SV Thankachan, R Shah, WK Hon… - Proceedings of the 34th …, 2011 - dl.acm.org
Inverted indexes are the most fundamental and widely used data structures in information
retrieval. For each unique word occurring in a document collection, the inverted index stores …

Wavelet trees: A survey

C Makris - Computer Science and Information Systems, 2012 - doiserbia.nb.rs
The topic of this paper is the exploration of the various characteristics of the wavelet tree
data structure, a data structure that was initially proposed for text compression applications …

Space-Efficient Frameworks for Top-k String Retrieval

WK Hon, R Shah, SV Thankachan… - Journal of the ACM (JACM), 2014 - dl.acm.org
The inverted index is the backbone of modern web search engines. For each word in a
collection of web documents, the index records the list of documents where this word occurs …

[HTML][HTML] Improved compressed indexes for full-text document retrieval

D Belazzougui, G Navarro, D Valenzuela - Journal of Discrete Algorithms, 2013 - Elsevier
We give new space/time tradeoffs for compressed indexes that answer document retrieval
queries on general sequences. On a collection of D documents of total length n, current …

[HTML][HTML] On non-progressive spread of influence through social networks

MA Fazli, M Ghodsi, J Habibi, P Jalaly… - Theoretical Computer …, 2014 - Elsevier
The spread of influence in social networks is studied in two main categories: progressive
models and non-progressive models (see, eg, the seminal work of Kempe et al.[8]). While …

Time-Optimal Top- Document Retrieval

G Navarro, Y Nekrich - SIAM Journal on Computing, 2017 - SIAM
Let \mathcalD be a collection of D documents, which are strings over an alphabet of size σ,
of total length n. We describe a data structure that uses linear space and reports k most …

Efficient range searching for categorical and plain data

Y Nekrich - ACM Transactions on Database Systems (TODS), 2014 - dl.acm.org
In the orthogonal range-searching problem, we store a set of input points S in a data
structure; the answer to a query Q is a piece of information about points in Q∩ S, for …