Efficiently teaching an effective dense retriever with balanced topic aware sampling

S Hofstätter, SC Lin, JH Yang, J Lin… - Proceedings of the 44th …, 2021 - dl.acm.org
A vital step towards the widespread adoption of neural retrieval models is their resource
efficiency throughout the training, indexing and query workflows. The neural IR community …

[图书][B] Pretrained transformers for text ranking: Bert and beyond

J Lin, R Nogueira, A Yates - 2022 - books.google.com
The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in
response to a query. Although the most common formulation of text ranking is search …

Colbert: Efficient and effective passage search via contextualized late interaction over bert

O Khattab, M Zaharia - Proceedings of the 43rd International ACM SIGIR …, 2020 - dl.acm.org
Recent progress in Natural Language Understanding (NLU) is driving fast-paced advances
in Information Retrieval (IR), largely owed to fine-tuning deep language models (LMs) for …

Document ranking with a pretrained sequence-to-sequence model

R Nogueira, Z Jiang, J Lin - arXiv preprint arXiv:2003.06713, 2020 - arxiv.org
This work proposes a novel adaptation of a pretrained sequence-to-sequence model to the
task of document ranking. Our approach is fundamentally different from a commonly …

Automated fact-checking for assisting human fact-checkers

P Nakov, D Corney, M Hasanain, F Alam… - arXiv preprint arXiv …, 2021 - arxiv.org
The reporting and the analysis of current events around the globe has expanded from
professional, editor-lead journalism all the way to citizen journalism. Nowadays, politicians …

Multi-stage document ranking with BERT

R Nogueira, W Yang, K Cho, J Lin - arXiv preprint arXiv:1910.14424, 2019 - arxiv.org
The advent of deep neural networks pre-trained via language modeling tasks has spurred a
number of successful applications in natural language processing. This work explores one …

Improving efficient neural ranking models with cross-architecture knowledge distillation

S Hofstätter, S Althammer, M Schröder… - arXiv preprint arXiv …, 2020 - arxiv.org
Retrieval and ranking models are the backbone of many applications such as web search,
open domain QA, or text-based recommender systems. The latency of neural ranking …

Pretrained transformers for text ranking: BERT and beyond

A Yates, R Nogueira, J Lin - Proceedings of the 14th ACM International …, 2021 - dl.acm.org
The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in
response to a query. Although the most common formulation of text ranking is search …

The expando-mono-duo design pattern for text ranking with pretrained sequence-to-sequence models

R Pradeep, R Nogueira, J Lin - arXiv preprint arXiv:2101.05667, 2021 - arxiv.org
We propose a design pattern for tackling text ranking problems, dubbed" Expando-Mono-
Duo", that has been empirically validated for a number of ad hoc retrieval tasks in different …

[PDF][PDF] BERT-PLI: Modeling paragraph-level interactions for legal case retrieval.

Y Shao, J Mao, Y Liu, W Ma, K Satoh, M Zhang, S Ma - IJCAI, 2020 - ijcai.org
Legal case retrieval is a specialized IR task that involves retrieving supporting cases given a
query case. Compared with traditional ad-hoc text retrieval, the legal case retrieval task is …