Analyzing imbalance among homogeneous index servers in a web search system

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu

Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

被引用次数：20949 相关文章所有 23 个版本

[PDF] researchgate.net

Challenges on distributed web retrieval

R Baeza-Yates, C Castillo, F Junqueira… - 2007 IEEE 23rd …, 2006 - ieeexplore.ieee.org

In the ocean of Web data, Web search engines are the primary way to access content. As the
data is on the order of petabytes, current search engines are very large centralized systems …

被引用次数：148 相关文章所有 11 个版本

[PDF] researchgate.net

Parallel implementation and performance of fastDNAml: a program for maximum likelihood phylogenetic inference

CA Stewart, D Hart, DK Berry, GJ Olsen… - Proceedings of the …, 2001 - dl.acm.org

This paper describes the parallel implementation of fastDNAml, a program for the maximum
likelihood inference of phylogenetic trees from DNA sequence data. Mathematical means of …

被引用次数：136 相关文章所有 12 个版本

[PDF] researchgate.net

Scalability challenges in web search engines

BB Cambazoglu, R Baeza-Yates - Advanced topics in information retrieval, 2011 - Springer

Continuous growth of the Web and user bases forces web search engine companies to
make costly investments on very large compute infrastructures. The scalability of these …

被引用次数：57 相关文章所有 8 个版本

[PDF] core.ac.uk

A term-based inverted index partitioning model for efficient distributed query processing

BB Cambazoglu, E Kayaaslan, S Jonassen… - ACM Transactions on …, 2013 - dl.acm.org

In a shared-nothing, distributed text retrieval system, queries are processed over an inverted
index that is partitioned among a number of index servers. In practice, the index is either …

被引用次数：49 相关文章所有 10 个版本

[PDF] researchgate.net

On the feasibility of multi-site web search engines

R Baeza-Yates, A Gionis, F Junqueira… - Proceedings of the 18th …, 2009 - dl.acm.org

Web search engines are often implemented as centralized systems. Designing and
implementing a Web search engine in a distributed environment is a challenging …

被引用次数：66 相关文章所有 2 个版本

[PDF] microsoft.com

Optimal aggregation policy for reducing tail latency of web search

JM Yun, Y He, S Elnikety, S Ren - … of the 38th International ACM SIGIR …, 2015 - dl.acm.org

A web search engine often employs partition-aggregate architecture, where an aggregator
propagates a user query to all index serving nodes (ISNs) and collects the responses from …

被引用次数：36 相关文章所有 7 个版本

[PDF] cmu.edu

Efficient distributed selective search

Y Kim, J Callan, JS Culpepper, A Moffat - Information Retrieval Journal, 2017 - Springer

Simulation and analysis have shown that selective search can reduce the cost of large-scale
distributed information retrieval. By partitioning the collection into small topical shards, and …

被引用次数：29 相关文章所有 15 个版本

[PDF] academia.edu

Tuning the capacity of search engines: Load-driven routing and incremental caching to reduce and balance the load

D Puppin, F Silvestri, R Perego… - ACM Transactions on …, 2010 - dl.acm.org

This article introduces an architecture for a document-partitioned search engine, based on a
novel approach combining collection selection and load balancing, called load-driven …

被引用次数：44 相关文章所有 8 个版本

[图书][B] Advanced topics in information retrieval

M Melucci, R Baeza-Yates - 2011 - books.google.com

Information retrieval is the science concerned with the effective and efficient retrieval of
documents starting from their semantic content. It is employed to fulfill some information …

被引用次数：30 相关文章所有 7 个版本

高级搜索

QQ 群