Human beings experience life through a spectrum of modes such as vision, taste, hearing, smell, and touch. These multiple modes are integrated for information processing in our …
P Hu, H Zhu, J Lin, D Peng, YP Zhao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
In this paper, we study how to make unsupervised cross-modal hashing (CMH) benefit from contrastive learning (CL) by overcoming two challenges. To be exact, i) to address the …
L Qu, M Liu, J Wu, Z Gao, L Nie - … of the 44th International ACM SIGIR …, 2021 - dl.acm.org
Image-text retrieval is a fundamental and crucial branch in information retrieval. Although much progress has been made in bridging vision and language, it remains challenging …
Z Yuan, W Zhang, C Tian, X Rong… - … on Geoscience and …, 2022 - ieeexplore.ieee.org
Cross-modal remote sensing text-image retrieval (RSCTIR) has recently become an urgent research hotspot due to its ability of enabling fast and flexible information extraction on …
Cross-modal hashing has sparked much attention in large-scale information retrieval for its storage and query efficiency. Despite the great success achieved by supervised …
P Hu, X Peng, H Zhu, L Zhen… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Recently, cross-modal retrieval is emerging with the help of deep multimodal learning. However, even for unimodal data, collecting large-scale well-annotated data is expensive …
L Zhen, P Hu, X Peng, RSM Goh… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Cross-modal retrieval (CMR) enables flexible retrieval experience across different modalities (eg, texts versus images), which maximally benefits us from the abundance of …
In this paper, we introduce a novel audio-visual multi-modal bridging framework that can utilize both audio and visual information, even with uni-modal inputs. We exploit a memory …
Y Liu, J Wu, L Qu, T Gan, J Yin… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Cross-modal retrieval aims to retrieve relevant data from another modality when given a query of one modality. Although most existing methods that rely on the label information of …