作者
Jérémy Barbay, Alejandro López-Ortiz, Tyler Lu, Alejandro Salinger
发表日期
2010/1/5
期刊
Journal of Experimental Algorithmics (JEA)
卷号
14
页码范围
3.7-3.24
出版商
ACM
简介
The intersection of large ordered sets is a common problem in the context of the evaluation of boolean queries to a search engine. In this article, we propose several improved algorithms for computing the intersection of sorted arrays, and in particular for searching sorted arrays in the intersection context. We perform an experimental comparison with the algorithms from the previous studies from Demaine, López-Ortiz, and Munro [ALENEX 2001] and from Baeza-Yates and Salinger [SPIRE 2005]; in addition, we implement and test the intersection algorithm from Barbay and Kenyon [SODA 2002] and its randomized variant [SAGA 2003]. We consider both the random data set from Baeza-Yates and Salinger, the Google queries used by Demaine et al., a corpus provided by Google, and a larger corpus from the TREC Terabyte 2006 efficiency query stream, along with its own query log. We measure the performance both …
引用总数
20102011201220132014201520162017201820192020202120222023557119979433221
学术搜索中的文章
J Barbay, A López-Ortiz, T Lu, A Salinger - Journal of Experimental Algorithmics (JEA), 2010