Cross-modal retrieval: a systematic review of methods and future directions

T Wang, F Li, L Zhu, J Li, Z Zhang, HT Shen - arXiv preprint arXiv …, 2023 - arxiv.org
With the exponential surge in diverse multi-modal data, traditional uni-modal retrieval
methods struggle to meet the needs of users seeking access to data across various …

Multiple instance relation graph reasoning for cross-modal hash retrieval

C Hou, Z Li, Z Tang, X Xie, H Ma - Knowledge-Based Systems, 2022 - Elsevier
The similarity calculation is too simple in most cross-modal hash retrieval methods, which do
not consider the impact of the relations between instances. To solve this problem, this paper …

Similarity Graph-correlation Reconstruction Network for unsupervised cross-modal hashing

D Yao, Z Li, B Li, C Zhang, H Ma - Expert Systems with Applications, 2024 - Elsevier
Existing cross-modal hash retrieval methods can simultaneously enhance retrieval speed
and reduce storage space. However, these methods face a major challenge in determining …

Continual Learning for Smart City: A Survey

L Yang, Z Luo, S Zhang, F Teng, T Li - arXiv preprint arXiv:2404.00983, 2024 - arxiv.org
With the digitization of modern cities, large data volumes and powerful computational
resources facilitate the rapid update of intelligent models deployed in smart cities. Continual …

Continual learning for cross-modal image-text retrieval based on domain-selective attention

R Yang, S Wang, Y Gu, J Wang, Y Sun, H Zhang… - Pattern Recognition, 2024 - Elsevier
Cross-modal image-text retrieval (CMITR) has been a high-value research topic for more
than a decade. In most of the previous studies, the data for all tasks are trained as a single …

Cross-Modal Alternating Learning with Task-Aware Representations for Continual Learning

W Li, BB Gao, B Xia, J Wang, J Liu, Y Liu… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Continual learning is a research field of artificial neural networks to simulate human lifelong
learning ability. Although a surge of investigations has achieved considerable performance …

RICH: A rapid method for image-text cross-modal hash retrieval

B Li, D Yao, Z Li - Displays, 2023 - Elsevier
Deep cross-modal hash retrieval (DCMHR) methods can effectively analyze the correlation
of multimodal data while maintaining efficiency. However, to pursue better accuracy, most …

Knowledge Decomposition and Replay: A Novel Cross-modal Image-Text Retrieval Continual Learning Method

R Yang, S Wang, H Zhang, S Xu, YH Guo… - Proceedings of the 31st …, 2023 - dl.acm.org
To enable machines to mimic human cognitive abilities and alleviate the catastrophic
forgetting problem in cross-modal image-text retrieval (CMITR), this paper proposes a novel …

Continual Vision-Language Retrieval via Dynamic Knowledge Rectification

Z Cui, Y Peng, X Wang, M Zhu, J Zhou - Proceedings of the AAAI …, 2024 - ojs.aaai.org
The recent large-scale pre-trained models like CLIP have aroused great concern in vision-
language tasks. However, when required to match image-text data collected in a streaming …

Multi-domain lifelong visual question answering via self-critical distillation

M Lao, N Pu, Y Liu, Z Zhong, EM Bakker… - Proceedings of the 31st …, 2023 - dl.acm.org
Visual Question Answering (VQA) has achieved significant success over the last few years,
while most studies focus on training a VQA model on a stationary domain (eg, a given …