Vector Space Proximity Based Document Retrieval For Document Embeddings Built By Transformers

P Khloponin - 2022 - spectrum.library.concordia.ca
Internet publications are staying atop of local and international events, generating hundreds,
sometimes thousands of news articles per day, making it difficult for readers to navigate this …

Evaluating Dense Passage Retrieval using Transformers

N Sadri - arXiv preprint arXiv:2208.06959, 2022 - arxiv.org
Although representational retrieval models based on Transformers have been able to make
major advances in the past few years, and despite the widely accepted conventions and …

Using document embeddings for background linking of news articles

P Khloponin, L Kosseim - … on Applications of Natural Language to …, 2021 - Springer
This paper describes our experiments in using document embeddings to provide
background links to news articles. This work was done as part of the recent TREC 2020 …

Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models

L Merrick, D Xu, G Nuti, D Campos - arXiv preprint arXiv:2405.05374, 2024 - arxiv.org
This report describes the training dataset creation and recipe behind the family of\texttt
{arctic-embed} text embedding models (a set of five models ranging from 22 to 334 million …

Cross-domain sentence modeling for relevance transfer with BERT

Z Akkalyoncu Yilmaz - 2019 - uwspace.uwaterloo.ca
Standard bag-of-words term-matching techniques in document retrieval fail to exploit rich
semantic information embedded in the document texts. One promising recent trend in …

Noise-reduction for automatically transferred relevance judgments

M Fröbe, C Akiki, M Potthast, M Hagen - International Conference of the …, 2022 - Springer
Abstract The TREC Deep Learning tracks used MS MARCO Version 1 as their official
training data until 2020 and switched to Version 2 in 2021. For Version 2, all previously …

CODER: An efficient framework for improving retrieval through COntextual Document Embedding Reranking

G Zerveas, N Rekabsaz, D Cohen… - arXiv preprint arXiv …, 2021 - arxiv.org
Contrastive learning has been the dominant approach to training dense retrieval models. In
this work, we investigate the impact of ranking context-an often overlooked aspect of …

PACRR: A position-aware neural IR model for relevance matching

K Hui, A Yates, K Berberich, G De Melo - arXiv preprint arXiv:1704.03940, 2017 - arxiv.org
In order to adopt deep learning for information retrieval, models are needed that can capture
all relevant information required to assess the relevance of a document to a given user …

Comparing score aggregation approaches for document retrieval with pretrained transformers

X Zhang, A Yates, J Lin - … Retrieval: 43rd European Conference on IR …, 2021 - Springer
While BERT has been shown to be effective for passage retrieval, its maximum input length
limitation poses a challenge when applying the model to document retrieval. In this work, we …

Overview of the TREC 2023 Product Product Search Track

D Campos, S Kallumadi, C Rosset, CX Zhai… - arXiv preprint arXiv …, 2023 - arxiv.org
This is the first year of the TREC Product search track. The focus this year was the creation of
a reusable collection and evaluation of the impact of the use of metadata and multi-modal …