PyTerrier: Declarative experimentation in Python from BM25 to dense retrieval

C Macdonald, N Tonellotto, S MacAvaney… - Proceedings of the 30th …, 2021 - dl.acm.org
PyTerrier is a Python-based retrieval framework for expressing simple and complex
information retrieval (IR) pipelines in a declarative manner. While making use of the long …

Optimizing web search using social annotations

S Bao, G Xue, X Wu, Y Yu, B Fei, Z Su - Proceedings of the 16th …, 2007 - dl.acm.org
This paper explores the use of social annotations to improve websearch. Nowadays, many
services, eg del. icio. us, have been developed for web users to organize and share their …

Hierarchical document categorization with support vector machines

L Cai, T Hofmann - Proceedings of the thirteenth ACM international …, 2004 - dl.acm.org
Automatically categorizing documents into pre-defined topic hierarchies or taxonomies is a
crucial step in knowledge and content management. Standard machine learning techniques …

Automatic identification of user goals in web search

U Lee, Z Liu, J Cho - Proceedings of the 14th international conference …, 2005 - dl.acm.org
There has been recent interests in studying the" goal" behind a user's Web query, so that
this goal can be used to improve the quality of a search engine's results. Previous studies …

Query type classification for web document retrieval

IH Kang, GC Kim - Proceedings of the 26th annual international ACM …, 2003 - dl.acm.org
The heterogeneous Web exacerbates IR problems and short user queries make them
worse. The contents of web documents are not enough to find good answer documents. Link …

Optimizing web search using web click-through data

GR Xue, HJ Zeng, Z Chen, Y Yu, WY Ma… - Proceedings of the …, 2004 - dl.acm.org
The performance of web search engines may often deteriorate due to the diversity and noisy
information contained within web pages. User click-through data can be used to introduce …

The importance of prior probabilities for entry page search

W Kraaij, T Westerveld, D Hiemstra - Proceedings of the 25th annual …, 2002 - dl.acm.org
An important class of searches on the world-wide-web has the goal to find an entry page
(homepage) of an organisation. Entry page search is quite different from Ad Hoc search …

Analysis of anchor text for web search

N Eiron, KS McCurley - Proceedings of the 26th annual international …, 2003 - dl.acm.org
It was observed as early as 1994 that anchor text in web documents is very useful in
improving the quality of web text search. In spite of the fact that many commercial search …

Verifying relevance between keywords and web site contents

B Zhang, HJ Zeng, Z Chen, WY Ma, L Li, Y Li… - US Patent …, 2007 - Google Patents
Abstract Systems and methods for verifying relevance between terms and Web site contents
are described. In one aspect, site contents from a bid URL are retrieved. Expanded term (s) …

Engineering a multi-purpose test collection for web retrieval experiments

P Bailey, N Craswell, D Hawking - Information Processing & Management, 2003 - Elsevier
Past research into text retrieval methods for the Web has been restricted by the lack of a test
collection capable of supporting experiments which are both realistic and reproducible. The …