L Boytsov - Journal of Experimental Algorithmics (JEA), 2011 - dl.acm.org
The primary goal of this article is to survey state-of-the-art indexing methods for approximate dictionary searching. To improve understanding of the field, we introduce a taxonomy that …
Search engines are exceptionally important tools for accessing information in today's world. In satisfying the information needs of millions of users, the effectiveness (the quality of the …
Wavelet trees are widely used in the representation of sequences, permutations, text collections, binary relations, discrete points, and other succinct data structures. We show …
B Ding, AC König - arXiv preprint arXiv:1103.2409, 2011 - arxiv.org
Set intersection is a fundamental operation in information retrieval and database systems. This paper introduces linear space data structures to represent sets such that their …
JS Culpepper, A Moffat - ACM Transactions on Information Systems …, 2010 - dl.acm.org
Conjunctive Boolean queries are a key component of modern information retrieval systems, especially when Web-scale repositories are being searched. A conjunctive query q is …
Y Sun, D Ferizovic, GE Belloch - Proceedings of the 23rd ACM SIGPLAN …, 2018 - dl.acm.org
Ordered (key-value) maps are an important and widely-used data type for large-scale data processing frameworks. Beyond simple search, insertion and deletion, more advanced …
N Ao, F Zhang, D Wu, DS Stones, G Wang… - Proceedings of the …, 2011 - dl.acm.org
Major web search engines answer thousands of queries per second requesting information about billions of web pages. The data sizes and query loads are growing at an exponential …
In this paper, we focus on sorted-set intersection which is an important part in many algorithms, eg, RID-list intersection, inverted indexes, and others. In contrast to traditional …
H Cohen, E Porat - Theoretical Computer Science, 2010 - Elsevier
In this paper we present a new problem, the fast set intersection problem, which is to preprocess a collection of sets in order to efficiently report the intersection of any two sets in …