[图书][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

Estimating search engine index size variability: a 9-year longitudinal study

A Van den Bosch, T Bogers, M De Kunder - Scientometrics, 2016 - Springer
One of the determining factors of the quality of Web search engines is the size of their index.
In addition to its influence on search result quality, the size of the indexed Web can also tell …

Federated search

M Shokouhi, L Si - Foundations and Trends® in Information …, 2011 - nowpublishers.com
Federated search (federated information retrieval or distributed information retrieval) is a
technique for searching multiple text collections simultaneously. Queries are submitted to a …

Random sampling from a search engine's index

Z Bar-Yossef, M Gurevich - Journal of the ACM (JACM), 2008 - dl.acm.org
We revisit a problem introduced by Bharat and Broder almost a decade ago: How to sample
random pages from the corpus of documents indexed by a search engine, using only the …

On random walk based graph sampling

RH Li, JX Yu, L Qin, R Mao, T Jin - 2015 IEEE 31st international …, 2015 - ieeexplore.ieee.org
Random walk based graph sampling has been recognized as a fundamental technique to
collect uniform node samples from a large graph. In this paper, we first present a …

An overview of Web search evaluation methods

R Ali, MMS Beg - Computers & Electrical Engineering, 2011 - Elsevier
Web search evaluation is the process of measuring the effectiveness of a Web search
system. Such an evaluation helps in identifying the most effective one and helps the users to …

Estimating sizes of social networks via biased sampling

L Katzir, E Liberty, O Somekh - … of the 20th international conference on …, 2011 - dl.acm.org
Online social networks have become very popular in recent years and their number of users
is already measured in many hundreds of millions. For various commercial and sociological …

Estimating clustering coefficients and size of social networks via random walk

SJ Hardiman, L Katzir - … of the 22nd international conference on World …, 2013 - dl.acm.org
Online social networks have become a major force in today's society and economy. The
largest of today's social networks may have hundreds of millions to more than a billion users …

Crowdsourced enumeration queries

B Trushkowsky, T Kraska, MJ Franklin… - 2013 IEEE 29th …, 2013 - ieeexplore.ieee.org
Hybrid human/computer database systems promise to greatly expand the usefulness of
query processing by incorporating the crowd for data gathering and other tasks. Such …

On estimating the average degree

A Dasgupta, R Kumar, T Sarlos - … of the 23rd international conference on …, 2014 - dl.acm.org
Networks are characterized by nodes and edges. While there has been a spate of recent
work on estimating the number of nodes in a network, the edge-estimation question appears …