Comparative analysis on cross-modal information retrieval: A review

P Kaur, HS Pannu, AK Malhi - Computer Science Review, 2021 - Elsevier
Human beings experience life through a spectrum of modes such as vision, taste, hearing,
smell, and touch. These multiple modes are integrated for information processing in our …

Cross-modal retrieval: a systematic review of methods and future directions

F Li, L Zhu, T Wang, J Li, Z Zhang, HT Shen - arXiv preprint arXiv …, 2023 - arxiv.org
With the exponential surge in diverse multi-modal data, traditional uni-modal retrieval
methods struggle to meet the needs of users demanding access to data from various …

Deep supervised cross-modal retrieval

L Zhen, P Hu, X Wang, D Peng - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Cross-modal retrieval aims to enable flexible retrieval across different modalities. The core
of cross-modal retrieval is how to measure the content similarity between different types of …

[PDF][PDF] Cross-modality person re-identification with generative adversarial training.

P Dai, R Ji, H Wang, Q Wu, Y Huang - IJCAI, 2018 - ijcai.org
Person re-identification (Re-ID) is an important task in video surveillance which
automatically searches and identifies people across different cameras. Despite the …

HSME: Hypersphere manifold embedding for visible thermal person re-identification

Y Hao, N Wang, J Li, X Gao - Proceedings of the AAAI conference on …, 2019 - ojs.aaai.org
Person Re-identification (re-ID) has great potential to contribute to video surveillance that
automatically searches and identifies people across different cameras. Heterogeneous …

Triplet-based deep hashing network for cross-modal retrieval

C Deng, Z Chen, X Liu, X Gao… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Given the benefits of its low storage requirements and high retrieval efficiency, hashing has
recently received increasing attention. In particular, cross-modal hashing has been widely …

CM-GANs: Cross-modal generative adversarial networks for common representation learning

Y Peng, J Qi - ACM Transactions on Multimedia Computing …, 2019 - dl.acm.org
It is known that the inconsistent distributions and representations of different modalities, such
as image and text, cause the heterogeneity gap, which makes it very challenging to correlate …

Multi-modal hashing for efficient multimedia retrieval: A survey

L Zhu, C Zheng, W Guan, J Li, Y Yang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the explosive growth of multimedia contents, multimedia retrieval is facing
unprecedented challenges on both storage cost and retrieval speed. Hashing technique can …

Unsupervised generative adversarial cross-modal hashing

J Zhang, Y Peng, M Yuan - Proceedings of the AAAI conference on …, 2018 - ojs.aaai.org
Cross-modal hashing aims to map heterogeneous multimedia data into a common
Hamming space, which can realize fast and flexible retrieval across different modalities …

Learning dual semantic relations with graph attention for image-text matching

K Wen, X Gu, Q Cheng - … on circuits and systems for video …, 2020 - ieeexplore.ieee.org
Image-Text Matching is one major task in cross-modal information processing. The main
challenge is to learn the unified visual and textual representations. Previous methods that …