Space4hgnn: a novel, modularized and reproducible platform to evaluate heterogeneous graph neural network

T Zhao, C Yang, Y Li, Q Gan, Z Wang, F Liang… - Proceedings of the 45th …, 2022 - dl.acm.org
Heterogeneous Graph Neural Network (HGNN) has been successfully employed in various
tasks, but we cannot accurately know the importance of different design dimensions of …

Multiple instance relation graph reasoning for cross-modal hash retrieval

C Hou, Z Li, Z Tang, X Xie, H Ma - Knowledge-Based Systems, 2022 - Elsevier
The similarity calculation is too simple in most cross-modal hash retrieval methods, which do
not consider the impact of the relations between instances. To solve this problem, this paper …

Similarity Graph-correlation Reconstruction Network for unsupervised cross-modal hashing

D Yao, Z Li, B Li, C Zhang, H Ma - Expert Systems with Applications, 2024 - Elsevier
Existing cross-modal hash retrieval methods can simultaneously enhance retrieval speed
and reduce storage space. However, these methods face a major challenge in determining …

Dual-stream knowledge-preserving hashing for unsupervised video retrieval

P Li, H Xie, J Ge, L Zhang, S Min, Y Zhang - European Conference on …, 2022 - Springer
Unsupervised video hashing usually optimizes binary codes by learning to reconstruct input
videos. Such reconstruction constraint spends much effort on frame-level temporal context …

Cross-lingual cross-modal pretraining for multimodal retrieval

H Fei, T Yu, P Li - Proceedings of the 2021 Conference of the …, 2021 - aclanthology.org
Recent pretrained vision-language models have achieved impressive performance on cross-
modal retrieval tasks in English. Their success, however, heavily depends on the availability …

Multimodal neural databases

G Trappolini, A Santilli, E Rodolà, A Halevy… - Proceedings of the 46th …, 2023 - dl.acm.org
The rise in loosely-structured data available through text, images, and other modalities has
called for new ways of querying them. Multimedia Information Retrieval has filled this gap …

Gilbert: Generative vision-language pre-training for image-text retrieval

W Hong, K Ji, J Liu, J Wang, J Chen… - Proceedings of the 44th …, 2021 - dl.acm.org
Given a text/image query, image-text retrieval aims to find the relevant items in the database.
Recently, visual-linguistic pre-training (VLP) methods have demonstrated promising …

RICH: A rapid method for image-text cross-modal hash retrieval

B Li, D Yao, Z Li - Displays, 2023 - Elsevier
Deep cross-modal hash retrieval (DCMHR) methods can effectively analyze the correlation
of multimodal data while maintaining efficiency. However, to pursue better accuracy, most …

Enhancing Dynamic Image Advertising with Vision-Language Pre-training

Z Wen, X Zhao, Z Jin, Y Yang, W Jia, X Chen… - Proceedings of the 46th …, 2023 - dl.acm.org
In the multimedia era, image becomes an effective medium in search advertising. Dynamic
Image Advertising (DIA), a system that matches queries with appropriate ad images and …

Inflate and shrink: Enriching and reducing interactions for fast text-image retrieval

H Liu, T Yu, P Li - Proceedings of the 2021 Conference on …, 2021 - aclanthology.org
By exploiting the cross-modal attention, cross-BERT methods have achieved state-of-the-art
accuracy in cross-modal retrieval. Nevertheless, the heavy text-image interactions in the …