Enhanced semantic similarity learning framework for image-text matching

K Zhang, B Hu, H Zhang, Z Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Image-text matching is a fundamental task to bridge vision and language. The critical
challenge lies in accurately learning the semantic similarity between these two …

[PDF][PDF] Enhancing Video-Text Matching via Sparse Stratified Sampling

C Lyu, W Li, T Ji, L Zhou, P Lohar, Y Yu, L Wang - researchgate.net
Video-text matching is a critical task in multimedia retrieval, but traditional methods often fail
to capture the diversity and depth of video content due to inefficient and inaccurate frame …