Cross-language information retrieval

P Galuščáková, DW Oard, S Nair - arXiv preprint arXiv:2111.05988, 2021 - arxiv.org
Two key assumptions shape the usual view of ranked retrieval:(1) that the searcher can
choose words for their query that might appear in the documents that they wish to see, and …

Overview of the TREC 2023 NeuCLIR Track

D Lawrie, S MacAvaney, J Mayfield… - arXiv preprint arXiv …, 2024 - arxiv.org
The principal goal of the TREC Neural Cross-Language Information Retrieval (NeuCLIR)
track is to study the impact of neural approaches to cross-language information retrieval. The …

CLIRMatrix: A massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval

S Sun, K Duh - Proceedings of the 2020 Conference on Empirical …, 2020 - aclanthology.org
We present CLIRMatrix, a massively large collection of bilingual and multilingual datasets
for Cross-Lingual Information Retrieval extracted automatically from Wikipedia. CLIRMatrix …

AfriCLIRMatrix: Enabling cross-lingual information retrieval for african languages

O Ogundepo, X Zhang, S Sun, K Duh… - Proceedings of the 2022 …, 2022 - aclanthology.org
Abstract Language diversity in NLP is critical in enabling the development of tools for a wide
range of users. However, there are limited resources for building such tools for many …

Cross-lingual learning-to-rank with shared representations

S Sasaki, S Sun, S Schamoni, K Duh… - Proceedings of the 2018 …, 2018 - aclanthology.org
Cross-lingual information retrieval (CLIR) is a document retrieval task where the documents
are written in a language different from that of the user's query. This is a challenging problem …

[PDF][PDF] Quality estimation from scratch (quetch): Deep learning for word-level translation quality estimation

J Kreutzer, S Schamoni, S Riezler - Proceedings of the Tenth …, 2015 - aclanthology.org
This paper describes the system submitted by the University of Heidelberg to the Shared
Task on Word-level Quality Estimation at the 2015 Workshop on Statistical Machine …

Mind the gap: Cross-lingual information retrieval with hierarchical knowledge enhancement

F Zhang, Z Zhang, X Ao, D Gao, F Zhuang… - Proceedings of the …, 2022 - ojs.aaai.org
Abstract Cross-Lingual Information Retrieval (CLIR) aims to rank the documents written in a
language different from the user's query. The intrinsic gap between different languages is an …

Improving low-resource cross-lingual document retrieval by reranking with deep bilingual representations

R Zhang, C Westerfield, S Shim, G Bingham… - arXiv preprint arXiv …, 2019 - arxiv.org
In this paper, we propose to boost low-resource cross-lingual document retrieval
performance with deep bilingual query-document representations. We match queries and …

[HTML][HTML] Why is a document relevant? Understanding the relevance scores in cross-lingual document retrieval

E Novak, L Bizjak, D Mladenić, M Grobelnik - Knowledge-Based Systems, 2022 - Elsevier
Modern cross-lingual document retrieval models are capable of finding documents relevant
to the query. However, they do not have the capabilities for explaining why the document is …

Mixed attention transformer for leveraging word-level knowledge to neural cross-lingual information retrieval

Z Huang, H Bonab, SM Sarwar, R Rahimi… - Proceedings of the 30th …, 2021 - dl.acm.org
Pre-trained contextualized representations offer great success for many downstream tasks,
including document ranking. The multilingual versions of such pre-trained representations …