Analyzing imbalance among homogeneous index servers in a web search system

CS Badue, R Baeza-Yates, B Ribeiro-Neto… - Information processing & …, 2007 - Elsevier
The performance of parallel query processing in a cluster of index servers is crucial for
modern web search systems. In such a scenario, the response time basically depends on …

Capacity planning for vertical search engines

C Badue, J Almeida, V Almeida, R Baeza-Yates… - arXiv preprint arXiv …, 2010 - arxiv.org
Vertical search engines focus on specific slices of content, such as the Web of a single
country or the document collection of a large corporation. Despite this, like general open …

[PDF][PDF] Modeling performance-driven workload characterization of web search systems

C Badue, R Baeza-Yates, B Ribeiro-Neto… - Proceedings of the 15th …, 2006 - dl.acm.org
Previous work on workload characterization for Web search systems mainly focuses on the
characterization of user search behavior [4, 7]. In this paper, however, we model workloads …

A combined semi-pipelined query processing architecture for distributed full-text retrieval

S Jonassen, SE Bratsberg - International conference on web information …, 2010 - Springer
Term-partitioning is an efficient way to distribute a large inverted index. Two fundamentally
different query processing approaches are pipelined and non-pipelined. While the pipelined …

[PDF][PDF] Impact of the query model and system settings on performance of distributed inverted indexes

S Jonassen, SE Bratsberg - Proceedings of the 22nd Norwegian …, 2009 - researchgate.net
This paper presents an evaluation of three partitioning methods for distributed inverted
indexes: local, global and hybrid indexing, combined with two generalized query models …

Diversified caching for replicated web search engines

C Xu, B Tang, ML Yiu - 2015 IEEE 31st International …, 2015 - ieeexplore.ieee.org
Commercial web search engines adopt parallel and replicated architecture in order to
support high query throughput. In this paper, we investigate the effect of caching on the …

Efficient query processing in distributed search engines

S Jonassen - 2013 - ntnuopen.ntnu.no
Web search engines have to deal with a rapidly increasing amount of information, high
query loads and tight performance constraints. The success of a search engine depends on …

Técnicas de caching de intersecciones en motores de búsqueda

GH Tolosa - 2016 - bibliotecadigital.exactas.uba.ar
Los motores de búsqueda procesan enormes cantidades de datos (páginas web)
paraconstruir estructuras sofisticadas que soportan la búsqueda. La cantidad cada vez …

Performance Analysis of Distributed Web Information Retrieval Systems

F Cacheda, V Formoso… - IEEE Latin America …, 2007 - ieeexplore.ieee.org
The importance and size of Web search engines is increasing daily. Information retrieval
systems based on a single centralized index present several problems, which lead to the …

Information and data management at PUC-Rio and UFMG

AL Furtado, N Ziviani - Proceedings of the VLDB Endowment, 2018 - dl.acm.org
This article presents a summary of the main activities of the Database & Information Systems
Research Group at Pontifícia Universidade Católica do Rio de Janeiro (PUC-Rio) and the …