W Li, Z Ma, LJ Deng, X Fan, Y Tian - … on Circuits and Systems for Video …, 2023 - dl.acm.org
Most of the image-text retrieval methods carry out accurate results using fine-grained
features for feature alignment. However, extracting the robustness features while …