S Sun, K Duh - Proceedings of the 2020 Conference on Empirical …, 2020 - aclanthology.org
We present CLIRMatrix, a massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval extracted automatically from Wikipedia. CLIRMatrix …
HC4 is a new suite of test collections for ad hoc Cross-Language Information Retrieval (CLIR), with Common Crawl News documents in Chinese, Persian, and Russian, topics in …
In this work, we explore a Multilingual Information Retrieval (MLIR) task, where the collection includes documents in multiple languages. We demonstrate that applying state-of-the-art …
Pre-trained contextualized representations offer great success for many downstream tasks, including document ranking. The multilingual versions of such pre-trained representations …
In cross-language information retrieval using probabilistic structured queries (PSQ), translation probabilities from statistical machine translation act as a bridge between the …
T Bi, L Yao, B Yang, H Zhang, W Luo… - arXiv preprint arXiv …, 2020 - arxiv.org
Query translation (QT) is a key component in cross-lingual information retrieval system (CLIR). With the help of deep learning, neural machine translation (NMT) has shown …
The traditional evaluation of information retrieval (IR) systems is generally very costly as it requires manual relevance annotation from human experts. Recent advancements in …
TM Tashu, M Lenz, T Horváth - Applied Soft Computing, 2023 - Elsevier
In this work, we propose a novel method for generating inter-lingual document representations using neural network concept compression. The presented approach is …
L Yao, B Yang, H Zhang, B Chen… - Proceedings of the 28th …, 2020 - aclanthology.org
Query translation (QT) serves as a critical factor in successful cross-lingual information retrieval (CLIR). Due to the lack of parallel query samples, neural-based QT models are …