Hcmsl: Hybrid cross-modal similarity learning for cross-modal retrieval

C Zhang, Y Wang, L Zhu, J Song, H Yin - ACM Transactions on …, 2021 - dl.acm.org

With the rapid development of online social recommendation system, substantial methods
have been proposed. Unlike traditional recommendation system, social recommendation …

被引用次数：48 相关文章所有 3 个版本

[PDF] arxiv.org

Improving text-audio retrieval by text-aware attention pooling and prior matrix revised loss

Y Xin, D Yang, Y Zou - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org

In text-audio retrieval (TAR) tasks, due to the heterogeneity of contents between text and
audio, the semantic information contained in the text is only similar to certain frames within …

被引用次数：26 相关文章所有 5 个版本

Semantic pre-alignment and ranking learning with unified framework for cross-modal retrieval

Q Cheng, Z Tan, K Wen, C Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Cross-modal retrieval aims at retrieving highly semantic relevant information among multi-
modalities. Existing cross-modal retrieval methods mainly explore the semantic consistency …

被引用次数：20 相关文章

Computation bits maximization in UAV-enabled mobile-edge computing system

L Lyu, F Zeng, Z Xiao, C Zhang, H Jiang… - IEEE Internet of …, 2021 - ieeexplore.ieee.org

In recent years, unmanned aerial vehicles (UAVs) have been widely used in various
industries (eg, search and rescue, express delivery, etc.) due to their high flexibility. In …

被引用次数：28 相关文章

[PDF] academia.edu

Boosting scene graph generation with visual relation saliency

Y Zhang, Y Pan, T Yao, R Huang, T Mei… - ACM Transactions on …, 2023 - dl.acm.org

The scene graph is a symbolic data structure that comprehensively describes the objects
and visual relations in a visual scene, while ignoring the inherent perceptual saliency of …

被引用次数：18 相关文章所有 3 个版本

[PDF] mdpi.com

A Survey of Full-Cycle Cross-Modal Retrieval: From a Representation Learning Perspective

S Wang, L Zhu, L Shi, H Mo, S Tan - Applied Sciences, 2023 - mdpi.com

Cross-modal retrieval aims to elucidate information fusion, imitate human learning, and
advance the field. Although previous reviews have primarily focused on binary and real …

被引用次数：3 相关文章所有 2 个版本

Multi-view inter-modality representation with progressive fusion for image-text matching

J Wu, L Wang, C Chen, J Lu, C Wu - Neurocomputing, 2023 - Elsevier

Recently, image-text matching has been intensively explored to bridge vision and language.
Previous methods explore an inter-modality relationship between an image-text pair from …

被引用次数：6 相关文章所有 2 个版本

[PDF] ieee.org

Cross-Modal Retrieval: A Review of Methodologies, Datasets, and Future Perspectives

Z Han, A Azman, MR Mustaffa, FB Khalid - IEEE Access, 2024 - ieeexplore.ieee.org

With the rapid development of science and technology, all types of mixed media contain
large amounts of data. Traditional single multimedia data can no longer satisfy daily …

Deep Neighborhood-aware Proxy Hashing with Uniform Distribution Constraint for Cross-modal Retrieval

Y Huo, Q Qibing, J Dai, W Zhang, L Huang… - ACM Transactions on …, 2024 - dl.acm.org

Cross-modal retrieval methods based on hashing have gained significant attention in both
academic and industrial research. Deep learning techniques have played a crucial role in …

被引用次数：3 相关文章

Cross-Modal Semantically Augmented Network for Image-Text Matching

T Yao, Y Li, Y Li, Y Zhu, G Wang, J Yue - ACM Transactions on …, 2023 - dl.acm.org

Image-text matching plays an important role in solving the problem of cross-modal
information processing. Since there are nonnegligible semantic differences between …

被引用次数：2 相关文章

高级搜索

QQ 群