J Yu, Y Ding,
J Dong,
Y Li - … of the 2024 International Conference on …, 2024 - dl.acm.org
Visual Semantic Embedding (VSE) is a prominent approach in image-text retrieval, aiming to
learn a deep embedding space that aligns visual data with semantic text labels. However …