Multi-graph heterogeneous interaction fusion for social recommendation

C Zhang, Y Wang, L Zhu, J Song, H Yin - ACM Transactions on …, 2021 - dl.acm.org
With the rapid development of online social recommendation system, substantial methods
have been proposed. Unlike traditional recommendation system, social recommendation …

Improving text-audio retrieval by text-aware attention pooling and prior matrix revised loss

Y Xin, D Yang, Y Zou - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
In text-audio retrieval (TAR) tasks, due to the heterogeneity of contents between text and
audio, the semantic information contained in the text is only similar to certain frames within …

Semantic pre-alignment and ranking learning with unified framework for cross-modal retrieval

Q Cheng, Z Tan, K Wen, C Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Cross-modal retrieval aims at retrieving highly semantic relevant information among multi-
modalities. Existing cross-modal retrieval methods mainly explore the semantic consistency …

Computation bits maximization in UAV-enabled mobile-edge computing system

L Lyu, F Zeng, Z Xiao, C Zhang, H Jiang… - IEEE Internet of …, 2021 - ieeexplore.ieee.org
In recent years, unmanned aerial vehicles (UAVs) have been widely used in various
industries (eg, search and rescue, express delivery, etc.) due to their high flexibility. In …

Boosting scene graph generation with visual relation saliency

Y Zhang, Y Pan, T Yao, R Huang, T Mei… - ACM Transactions on …, 2023 - dl.acm.org
The scene graph is a symbolic data structure that comprehensively describes the objects
and visual relations in a visual scene, while ignoring the inherent perceptual saliency of …

A Survey of Full-Cycle Cross-Modal Retrieval: From a Representation Learning Perspective

S Wang, L Zhu, L Shi, H Mo, S Tan - Applied Sciences, 2023 - mdpi.com
Cross-modal retrieval aims to elucidate information fusion, imitate human learning, and
advance the field. Although previous reviews have primarily focused on binary and real …

Multi-view inter-modality representation with progressive fusion for image-text matching

J Wu, L Wang, C Chen, J Lu, C Wu - Neurocomputing, 2023 - Elsevier
Recently, image-text matching has been intensively explored to bridge vision and language.
Previous methods explore an inter-modality relationship between an image-text pair from …

Cross-Modal Retrieval: A Review of Methodologies, Datasets, and Future Perspectives

Z Han, A Azman, MR Mustaffa, FB Khalid - IEEE Access, 2024 - ieeexplore.ieee.org
With the rapid development of science and technology, all types of mixed media contain
large amounts of data. Traditional single multimedia data can no longer satisfy daily …

Deep Neighborhood-aware Proxy Hashing with Uniform Distribution Constraint for Cross-modal Retrieval

Y Huo, Q Qibing, J Dai, W Zhang, L Huang… - ACM Transactions on …, 2024 - dl.acm.org
Cross-modal retrieval methods based on hashing have gained significant attention in both
academic and industrial research. Deep learning techniques have played a crucial role in …

Cross-Modal Semantically Augmented Network for Image-Text Matching

T Yao, Y Li, Y Li, Y Zhu, G Wang, J Yue - ACM Transactions on …, 2023 - dl.acm.org
Image-text matching plays an important role in solving the problem of cross-modal
information processing. Since there are nonnegligible semantic differences between …