A comprehensive survey on cross-modal retrieval

K Wang, Q Yin, W Wang, S Wu, L Wang - arXiv preprint arXiv:1607.06215, 2016 - arxiv.org
In recent years, cross-modal retrieval has drawn much attention due to the rapid growth of
multimodal data. It takes one type of data as the query to retrieve relevant data of another …

Survey on deep multi-modal data analytics: Collaboration, rivalry, and fusion

Y Wang - ACM Transactions on Multimedia Computing …, 2021 - dl.acm.org
With the development of web technology, multi-modal or multi-view data has surged as a
major stream for big data, where each modal/view encodes individual property of data …

Joint and individual matrix factorization hashing for large-scale cross-modal retrieval

D Wang, Q Wang, L He, X Gao, Y Tian - Pattern recognition, 2020 - Elsevier
Multimodal hashing methods have gained considerable attention in recent years due to their
effectiveness and efficiency for cross-modal similarity searches. Existing multimodal hashing …

Survey of Research on Deep Learning Image-Text Cross-Modal Retrieval.

LIU Ying, GUO Yingying, F Jie… - Journal of Frontiers …, 2022 - search.ebscohost.com
As the rapid development of deep neural networks, multi-modal learning techniques are
widely concerned. Cross-modal retrieval is an important branch of multimodal learning. Its …

Hcmsl: Hybrid cross-modal similarity learning for cross-modal retrieval

C Zhang, J Song, X Zhu, L Zhu, S Zhang - ACM Transactions on …, 2021 - dl.acm.org
The purpose of cross-modal retrieval is to find the relationship between different modal
samples and to retrieve other modal samples with similar semantics by using a certain …

Secure and robust watermark scheme based on multiple transforms and particle swarm optimization algorithm

NR Zhou, AW Luo, WP Zou - Multimedia Tools and Applications, 2019 - Springer
To improve the security, robustness and imperceptibility of watermark schemes, a novel
watermark scheme is devised by fusing multiple watermark techniques, including lifting …

Deep triplet neural networks with cluster-cca for audio-visual cross-modal retrieval

D Zeng, Y Yu, K Oyama - ACM Transactions on Multimedia Computing …, 2020 - dl.acm.org
Cross-modal retrieval aims to retrieve data in one modality by a query in another modality,
which has been a very interesting research issue in the field of multimedia, information …

Multi-scale network with shared cross-attention for audio–visual correlation learning

J Zhang, Y Yu, S Tang, W Li, J Wu - Neural Computing and Applications, 2023 - Springer
Cross-modal audio–visual correlation learning has been an interesting research topic,
which aims to capture and understand semantic correspondences between audio and …

Image-text bidirectional learning network based cross-modal retrieval

Z Li, H Lu, H Fu, G Gu - Neurocomputing, 2022 - Elsevier
The problem of cross-modal retrieval has attracted significant attention in the cross-media
retrieval community. One key challenge of cross-modal retrieval is to eliminate the …

Adversarial learning-based semantic correlation representation for cross-modal retrieval

L Zhu, J Song, X Zhu, C Zhang, S Zhang… - IEEE …, 2020 - ieeexplore.ieee.org
Cross-modal retrieval has become a hot issue in past years. Many existing works pay
attentions on correlation learning to generate a common subspace for cross-modal …