[图书][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

Federated search techniques: an overview of the trends and state of the art

A Garba, S Wu, S Khalid - Knowledge and Information Systems, 2023 - Springer
Conventional search engines, such as Bing, Baidu, and Google, offer a convenient way for
users to seek information on the web. However, with all the benefits they provide, one major …

Federated search

M Shokouhi, L Si - Foundations and Trends® in Information …, 2011 - nowpublishers.com
Federated search (federated information retrieval or distributed information retrieval) is a
technique for searching multiple text collections simultaneously. Queries are submitted to a …

Central-rank-based collection selection in uncooperative distributed information retrieval

M Shokouhi - European conference on information retrieval, 2007 - Springer
Collection selection is one of the key problems in distributed information retrieval. Due to
resource constraints it is not usually feasible to search all collections in response to a query …

Challenges on distributed web retrieval

R Baeza-Yates, C Castillo, F Junqueira… - 2007 IEEE 23rd …, 2006 - ieeexplore.ieee.org
In the ocean of Web data, Web search engines are the primary way to access content. As the
data is on the order of petabytes, current search engines are very large centralized systems …

SUSHI: Scoring scaled samples for server selection

P Thomas, M Shokouhi - Proceedings of the 32nd international ACM …, 2009 - dl.acm.org
Modern techniques for distributed information retrieval use a set of documents sampled from
each server, but these samples have been underutilised in server selection. We describe a …

Unbiased estimation of size and other aggregates over hidden web databases

A Dasgupta, X Jin, B Jewell, N Zhang… - Proceedings of the 2010 …, 2010 - dl.acm.org
Many websites provide restrictive form-like interfaces which allow users to execute search
queries on the underlying hidden databases. In this paper, we consider the problem of …

Aggregated search

J Arguello - Foundations and Trends® in Information …, 2017 - nowpublishers.com
The goal of aggregated search is to provide integrated search across multiple
heterogeneous search services in a unified interface—a single query box and a common …

Robust result merging using sample-based score estimates

M Shokouhi, J Zobel - ACM Transactions on Information Systems (TOIS), 2009 - dl.acm.org
In federated information retrieval, a query is routed to multiple collections and a single
answer list is constructed by combining the results. Such metasearch provides a mechanism …

[PDF][PDF] 基于属性相关度的Web 数据库大小估算方法

凌妍妍, 孟小峰, 刘伟 - 2008 - Citeseer
提出了一种基于词频统计的方法以估算Web 数据库的规模. 通过分析Web
数据库查询接口中属性之间的相关度来获取某个属性上的一组随机样本; 并对该属性分别提交由 …