Cross-modal retrieval with partially mismatched pairs

P Hu, Z Huang, D Peng, X Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In this paper, we study a challenging but less-touched problem in cross-modal retrieval, ie,
partially mismatched pairs (PMPs). Specifically, in real-world scenarios, a huge number of …

Cross-modal image-text retrieval with multitask learning

J Luo, Y Shen, X Ao, Z Zhao, M Yang - Proceedings of the 28th ACM …, 2019 - dl.acm.org
In this paper, we propose a multi-task learning approach for cross-modal image-text
retrieval. First, a correlation network is proposed for relation recognition task, which helps …

Cross-lingual cross-modal retrieval with noise-robust learning

Y Wang, J Dong, T Liang, M Zhang, R Cai… - Proceedings of the 30th …, 2022 - dl.acm.org
Despite the recent developments in the field of cross-modal retrieval, there has been less
research focusing on low-resource languages due to the lack of manually annotated …

Exposing and mitigating spurious correlations for cross-modal retrieval

JM Kim, A Koepke, C Schmid… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Cross-modal retrieval methods are the preferred tool to search databases for the text that
best matches a query image and vice versa However, image-text retrieval models commonly …

Preserving semantic neighborhoods for robust cross-modal retrieval

C Thomas, A Kovashka - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
The abundance of multimodal data (eg social media posts) has inspired interest in cross-
modal retrieval methods. Popular approaches rely on a variety of metric learning losses …

SMAN: Stacked multimodal attention network for cross-modal image–text retrieval

Z Ji, H Wang, J Han, Y Pang - IEEE transactions on cybernetics, 2020 - ieeexplore.ieee.org
This article focuses on tackling the task of the cross-modal image–text retrieval which has
been an interdisciplinary topic in both computer vision and natural language processing …

End-to-end cross-modality retrieval with CCA projections and pairwise ranking loss

M Dorfer, J Schlüter, A Vall, F Korzeniowski… - International Journal of …, 2018 - Springer
Cross-modality retrieval encompasses retrieval tasks where the fetched items are of a
different type than the search query, eg, retrieving pictures relevant to a given text query. The …

Cross-modal image-text retrieval with semantic consistency

H Chen, G Ding, Z Lin, S Zhao, J Han - Proceedings of the 27th ACM …, 2019 - dl.acm.org
Cross-modal image-text retrieval has been a long-standing challenge in the multimedia
community. Existing methods explore various complicated embedding spaces to assess the …

Modal-adversarial semantic learning network for extendable cross-modal retrieval

X Xu, J Song, H Lu, Y Yang, F Shen… - Proceedings of the 2018 …, 2018 - dl.acm.org
Cross-modal retrieval, eg, using an image query to search related text and vice-versa, has
become a highlighted research topic, to provide flexible retrieval experience across multi …

Deep evidential learning with noisy correspondence for cross-modal retrieval

Y Qin, D Peng, X Peng, X Wang, P Hu - Proceedings of the 30th ACM …, 2022 - dl.acm.org
Cross-modal retrieval has been a compelling topic in the multimodal community. Recently,
to mitigate the high cost of data collection, the co-occurred pairs (eg, image and text) could …