[图书][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

Challenges on distributed web retrieval

R Baeza-Yates, C Castillo, F Junqueira… - 2007 IEEE 23rd …, 2006 - ieeexplore.ieee.org
In the ocean of Web data, Web search engines are the primary way to access content. As the
data is on the order of petabytes, current search engines are very large centralized systems …

Parallel implementation and performance of fastDNAml: a program for maximum likelihood phylogenetic inference

CA Stewart, D Hart, DK Berry, GJ Olsen… - Proceedings of the …, 2001 - dl.acm.org
This paper describes the parallel implementation of fastDNAml, a program for the maximum
likelihood inference of phylogenetic trees from DNA sequence data. Mathematical means of …

Scalability challenges in web search engines

BB Cambazoglu, R Baeza-Yates - Advanced topics in information retrieval, 2011 - Springer
Continuous growth of the Web and user bases forces web search engine companies to
make costly investments on very large compute infrastructures. The scalability of these …

A term-based inverted index partitioning model for efficient distributed query processing

BB Cambazoglu, E Kayaaslan, S Jonassen… - ACM Transactions on …, 2013 - dl.acm.org
In a shared-nothing, distributed text retrieval system, queries are processed over an inverted
index that is partitioned among a number of index servers. In practice, the index is either …

On the feasibility of multi-site web search engines

R Baeza-Yates, A Gionis, F Junqueira… - Proceedings of the 18th …, 2009 - dl.acm.org
Web search engines are often implemented as centralized systems. Designing and
implementing a Web search engine in a distributed environment is a challenging …

Optimal aggregation policy for reducing tail latency of web search

JM Yun, Y He, S Elnikety, S Ren - … of the 38th International ACM SIGIR …, 2015 - dl.acm.org
A web search engine often employs partition-aggregate architecture, where an aggregator
propagates a user query to all index serving nodes (ISNs) and collects the responses from …

Efficient distributed selective search

Y Kim, J Callan, JS Culpepper, A Moffat - Information Retrieval Journal, 2017 - Springer
Simulation and analysis have shown that selective search can reduce the cost of large-scale
distributed information retrieval. By partitioning the collection into small topical shards, and …

Tuning the capacity of search engines: Load-driven routing and incremental caching to reduce and balance the load

D Puppin, F Silvestri, R Perego… - ACM Transactions on …, 2010 - dl.acm.org
This article introduces an architecture for a document-partitioned search engine, based on a
novel approach combining collection selection and load balancing, called load-driven …

[图书][B] Advanced topics in information retrieval

M Melucci, R Baeza-Yates - 2011 - books.google.com
Information retrieval is the science concerned with the effective and efficient retrieval of
documents starting from their semantic content. It is employed to fulfill some information …