An overview of cross-media retrieval: Concepts, methodologies, benchmarks, and challenges

P Kaur, HS Pannu, AK Malhi - Computer Science Review, 2021 - Elsevier

Human beings experience life through a spectrum of modes such as vision, taste, hearing,
smell, and touch. These multiple modes are integrated for information processing in our …

被引用次数：96 相关文章所有 3 个版本

[PDF] arxiv.org

Cross-modal retrieval: a systematic review of methods and future directions

F Li, L Zhu, T Wang, J Li, Z Zhang, HT Shen - arXiv preprint arXiv …, 2023 - arxiv.org

With the exponential surge in diverse multi-modal data, traditional uni-modal retrieval
methods struggle to meet the needs of users demanding access to data from various …

被引用次数：15 相关文章所有 3 个版本

[PDF] thecvf.com

Deep supervised cross-modal retrieval

L Zhen, P Hu, X Wang, D Peng - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Cross-modal retrieval aims to enable flexible retrieval across different modalities. The core
of cross-modal retrieval is how to measure the content similarity between different types of …

被引用次数：446 相关文章所有 6 个版本

[PDF] ijcai.org

[PDF][PDF] Cross-modality person re-identification with generative adversarial training.

P Dai, R Ji, H Wang, Q Wu, Y Huang - IJCAI, 2018 - ijcai.org

Person re-identification (Re-ID) is an important task in video surveillance which
automatically searches and identifies people across different cameras. Despite the …

被引用次数：450 相关文章所有 5 个版本

[PDF] aaai.org

HSME: Hypersphere manifold embedding for visible thermal person re-identification

Y Hao, N Wang, J Li, X Gao - Proceedings of the AAAI conference on …, 2019 - ojs.aaai.org

Person Re-identification (re-ID) has great potential to contribute to video surveillance that
automatically searches and identifies people across different cameras. Heterogeneous …

被引用次数：298 相关文章所有 5 个版本

[PDF] arxiv.org

Triplet-based deep hashing network for cross-modal retrieval

C Deng, Z Chen, X Liu, X Gao… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org

Given the benefits of its low storage requirements and high retrieval efficiency, hashing has
recently received increasing attention. In particular, cross-modal hashing has been widely …

被引用次数：383 相关文章所有 9 个版本

[PDF] arxiv.org

CM-GANs: Cross-modal generative adversarial networks for common representation learning

Y Peng, J Qi - ACM Transactions on Multimedia Computing …, 2019 - dl.acm.org

It is known that the inconsistent distributions and representations of different modalities, such
as image and text, cause the heterogeneity gap, which makes it very challenging to correlate …

被引用次数：308 相关文章所有 4 个版本

Multi-modal hashing for efficient multimedia retrieval: A survey

L Zhu, C Zheng, W Guan, J Li, Y Yang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the explosive growth of multimedia contents, multimedia retrieval is facing
unprecedented challenges on both storage cost and retrieval speed. Hashing technique can …

被引用次数：49 相关文章所有 4 个版本

[PDF] aaai.org

Unsupervised generative adversarial cross-modal hashing

J Zhang, Y Peng, M Yuan - Proceedings of the AAAI conference on …, 2018 - ojs.aaai.org

Cross-modal hashing aims to map heterogeneous multimedia data into a common
Hamming space, which can realize fast and flexible retrieval across different modalities …

被引用次数：228 相关文章所有 6 个版本

[PDF] arxiv.org

Learning dual semantic relations with graph attention for image-text matching

K Wen, X Gu, Q Cheng - … on circuits and systems for video …, 2020 - ieeexplore.ieee.org

Image-Text Matching is one major task in cross-modal information processing. The main
challenge is to learn the unified visual and textual representations. Previous methods that …

被引用次数：100 相关文章所有 3 个版本

高级搜索

QQ 群