Heterogeneous attention network for effective and efficient cross-modal retrieval

Space4hgnn: a novel, modularized and reproducible platform to evaluate heterogeneous graph neural network

T Zhao, C Yang, Y Li, Q Gan, Z Wang, F Liang… - Proceedings of the 45th …, 2022 - dl.acm.org

Heterogeneous Graph Neural Network (HGNN) has been successfully employed in various
tasks, but we cannot accurately know the importance of different design dimensions of …

被引用次数：40 相关文章所有 5 个版本

Multiple instance relation graph reasoning for cross-modal hash retrieval

C Hou, Z Li, Z Tang, X Xie, H Ma - Knowledge-Based Systems, 2022 - Elsevier

The similarity calculation is too simple in most cross-modal hash retrieval methods, which do
not consider the impact of the relations between instances. To solve this problem, this paper …

被引用次数：21 相关文章所有 2 个版本

Similarity Graph-correlation Reconstruction Network for unsupervised cross-modal hashing

D Yao, Z Li, B Li, C Zhang, H Ma - Expert Systems with Applications, 2024 - Elsevier

Existing cross-modal hash retrieval methods can simultaneously enhance retrieval speed
and reduce storage space. However, these methods face a major challenge in determining …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Dual-stream knowledge-preserving hashing for unsupervised video retrieval

P Li, H Xie, J Ge, L Zhang, S Min, Y Zhang - European Conference on …, 2022 - Springer

Unsupervised video hashing usually optimizes binary codes by learning to reconstruct input
videos. Such reconstruction constraint spends much effort on frame-level temporal context …

被引用次数：13 相关文章所有 5 个版本

[PDF] aclanthology.org

Cross-lingual cross-modal pretraining for multimodal retrieval

H Fei, T Yu, P Li - Proceedings of the 2021 Conference of the …, 2021 - aclanthology.org

Recent pretrained vision-language models have achieved impressive performance on cross-
modal retrieval tasks in English. Their success, however, heavily depends on the availability …

被引用次数：36 相关文章

[PDF] acm.org

Multimodal neural databases

G Trappolini, A Santilli, E Rodolà, A Halevy… - Proceedings of the 46th …, 2023 - dl.acm.org

The rise in loosely-structured data available through text, images, and other modalities has
called for new ways of querying them. Multimedia Information Retrieval has filled this gap …

被引用次数：11 相关文章所有 6 个版本

[PDF] acm.org

Gilbert: Generative vision-language pre-training for image-text retrieval

W Hong, K Ji, J Liu, J Wang, J Chen… - Proceedings of the 44th …, 2021 - dl.acm.org

Given a text/image query, image-text retrieval aims to find the relevant items in the database.
Recently, visual-linguistic pre-training (VLP) methods have demonstrated promising …

被引用次数：29 相关文章

RICH: A rapid method for image-text cross-modal hash retrieval

B Li, D Yao, Z Li - Displays, 2023 - Elsevier

Deep cross-modal hash retrieval (DCMHR) methods can effectively analyze the correlation
of multimodal data while maintaining efficiency. However, to pursue better accuracy, most …

被引用次数：7 相关文章

[PDF] arxiv.org

Enhancing Dynamic Image Advertising with Vision-Language Pre-training

Z Wen, X Zhao, Z Jin, Y Yang, W Jia, X Chen… - Proceedings of the 46th …, 2023 - dl.acm.org

In the multimedia era, image becomes an effective medium in search advertising. Dynamic
Image Advertising (DIA), a system that matches queries with appropriate ad images and …

被引用次数：4 相关文章所有 4 个版本

[PDF] aclanthology.org

Inflate and shrink: Enriching and reducing interactions for fast text-image retrieval

H Liu, T Yu, P Li - Proceedings of the 2021 Conference on …, 2021 - aclanthology.org

By exploiting the cross-modal attention, cross-BERT methods have achieved state-of-the-art
accuracy in cross-modal retrieval. Nevertheless, the heavy text-image interactions in the …

被引用次数：17 相关文章所有 3 个版本

高级搜索

QQ 群