As COVID-19 hounds the world, the common cause of finding a swift solution to manage the pandemic has brought together researchers, institutions, governments, and society at large …
We introduce a FEVER-like dataset COVID-Fact of $4,086 $ claims concerning the COVID- 19 pandemic. The dataset contains claims, evidence for the claims, and contradictory claims …
Recent work has shown that small distilled language models are strong competitors to models that are orders of magnitude larger and slower in a wide range of information …
Bi-encoders and cross-encoders are widely used in many state-of-the-art retrieval pipelines. In this work we study the generalization ability of these two types of architectures on a wide …
arXiv:2105.05686v2 [cs.IR] 25 Oct 2021 Page 1 arXiv:2105.05686v2 [cs.IR] 25 Oct 2021 Yes, BM25 is a Strong Baseline for Legal Case Retrieval Guilherme Moraes Rosa NeuralMind …
T Dai, J Zhao, D Li, S Tian, X Zhao, S Pan - Expert Systems with …, 2023 - Elsevier
The outbreak of COVID-19 brings almost the biggest explosions of scientific literature ever. Facing such volume literature, it is hard for researches to find desired citation when carrying …
The advent of multilingual language models has generated a resurgence of interest in cross- lingual information retrieval (CLIR), which is the task of searching documents in one …
Recent work has shown that language models scaled to billions of parameters, such as GPT- 3, perform remarkably well in zero-shot and few-shot scenarios. In this work, we experiment …
This paper reports on a study of cross-lingual information retrieval (CLIR) using the mT5- XXL reranker on the NeuCLIR track of TREC 2022. Perhaps the biggest contribution of this …