Analyzing imbalance among homogeneous index servers in a web search system

CS Badue, R Baeza-Yates, B Ribeiro-Neto… - Information processing & …, 2007 - Elsevier
The performance of parallel query processing in a cluster of index servers is crucial for
modern web search systems. In such a scenario, the response time basically depends on …

Mining query logs to optimize index partitioning in parallel web search engines

C Lucchese, S Orlando, R Perego, F Silvestri - 2nd International ICST …, 2010 - eudl.eu
Abstract Large-scale Parallel Web Search Engines (WSEs) needs to adopt a strategy for
partitioning the inverted index among a set of parallel server nodes. In this paper we are …

Basic issues on the processing of web queries

C Badue, R Barbosa, P Golgher… - Proceedings of the 28th …, 2005 - dl.acm.org
In this paper we study three basic and key issues related to Web query processing: load
balance, broker behavior, and performance by individual index servers. Our study, while …

[图书][B] High performance issues in web search engines: Algorithms and techniques

F Silvestri - 2004 - Citeseer
For hundreds of years the mankind has organized information in order to make it more
accessible to the others. The last media born to globally provide information is the Internet …

Models and algorithms for parallel text retrieval

BB Cambazoğlu - 2006 - search.proquest.com
In the last decade, search engines became an integral part of our lives. The current state-of-
the-art in search engine technology relies on parallel text retrieval. Basically, a parallel text …

Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems

BB Cambazoglu, A Catal, C Aykanat - International Symposium on …, 2006 - Springer
Shared-nothing, parallel text retrieval systems require an inverted index, representing a
document collection, to be partitioned among a number of processors. In general, the index …

Hybrid query scheduling for a replicated search engine

A Freire, C Macdonald, N Tonellotto, I Ounis… - Advances in Information …, 2013 - Springer
Search engines use replication and distribution of large indices across many query servers
to achieve efficient retrieval. Under high query load, queries can be scheduled to replicas …

Load-balancing and caching for collection selection architectures

D Puppin, F Silvestri, R Perego… - 2nd International ICST …, 2010 - eudl.eu
To address the rapid growth of the Internet, modern Web search engines have to adopt
distributed organizations, where the collection of indexed documents is partitioned among …

Asynchronous iterative computations with Web information retrieval structures: The PageRank case

G Kollias, E Gallopoulos, DB Szyld - arXiv preprint cs/0606047, 2006 - arxiv.org
There are several ideas being used today for Web information retrieval, and specifically in
Web search engines. The PageRank algorithm is one of those that introduce a content …

Efficiency considerations for scalable information retrieval servers

O Frieder, DA Grossman, A Chowdhury, G Frieder - 2000 - jodi-ojs-tdl.tdl.org
We review a variety of techniques to improve efficiency in information retrieval. Given the
increasing volumes of data that are available electronically, understanding and using such …