X Qian, B Liu - … on Computer Vision, Application, and Algorithm …, 2025 - spiedigitallibrary.org
In recent years, the CLIP model has achieved remarkable success in image-text retrieval
tasks through contrastive learning. However, CLIP still exhibits certain limitations when …