Semantic models for the first-stage retrieval: A comprehensive review

J Guo, Y Cai, Y Fan, F Sun, R Zhang… - ACM Transactions on …, 2022 - dl.acm.org
Multi-stage ranking pipelines have been a practical solution in modern search systems,
where the first-stage retrieval is to return a subset of candidate documents and latter stages …

How to approach ambiguous queries in conversational search: A survey of techniques, approaches, tools, and challenges

K Keyvan, JX Huang - ACM Computing Surveys, 2022 - dl.acm.org
The advent of recent Natural Language Processing technology has led human and machine
interactions more toward conversation. In Conversational Search Systems (CSS) like …

Asking clarifying questions in open-domain information-seeking conversations

M Aliannejadi, H Zamani, F Crestani… - Proceedings of the 42nd …, 2019 - dl.acm.org
Users often fail to formulate their complex information needs in a single query. As a
consequence, they may need to scan multiple result pages or reformulate their queries …

Building and evaluating open-domain dialogue corpora with clarifying questions

M Aliannejadi, J Kiseleva, A Chuklin, J Dalton… - arXiv preprint arXiv …, 2021 - arxiv.org
Enabling open-domain dialogue systems to ask clarifying questions when appropriate is an
important direction for improving the quality of the system response. Namely, for cases when …

Simplified data wrangling with ir_datasets

S MacAvaney, A Yates, S Feldman, D Downey… - Proceedings of the 44th …, 2021 - dl.acm.org
Managing the data for Information Retrieval (IR) experiments can be challenging. Dataset
documentation is scattered across the Internet and once one obtains a copy of the data …

The information retrieval experiment platform

M Fröbe, JH Reimer, S MacAvaney, N Deckers… - Proceedings of the 46th …, 2023 - dl.acm.org
We integrate irdatasets, ir_measures, and PyTerrier with TIRA in the Information Retrieval
Experiment Platform (TIREx) to promote more standardized, reproducible, scalable, and …

Efficient and effective spam filtering and re-ranking for large web datasets

GV Cormack, MD Smucker, CLA Clarke - Information retrieval, 2011 - Springer
The TREC 2009 web ad hoc and relevance feedback tasks used a new document collection,
the ClueWeb09 dataset, which was crawled from the general web in early 2009. This …

Exploiting simulated user feedback for conversational search: Ranking, rewriting, and beyond

P Owoicho, I Sekulic, M Aliannejadi, J Dalton… - Proceedings of the 46th …, 2023 - dl.acm.org
This research aims to explore various methods for assessing user feedback in mixed-
initiative conversational search (CS) systems. While CS systems enjoy profuse …

Increasing cheat robustness of crowdsourcing tasks

C Eickhoff, AP de Vries - Information retrieval, 2013 - Springer
Crowdsourcing successfully strives to become a widely used means of collecting large-scale
scientific corpora. Many research fields, including Information Retrieval, rely on this novel …

Search result diversification

RLT Santos, C Macdonald, I Ounis - Foundations and Trends® …, 2015 - nowpublishers.com
Ranking in information retrieval has been traditionally approached as a pursuit of relevant
information, under the assumption that the users' information needs are unambiguously …