X Yu, L Zhang,
Z Wu,
D Zhu - IEEE Transactions on Medical …, 2024 - ieeexplore.ieee.org
Multi-modality learning, exemplified by the language-image pair pre-trained CLIP model,
has demonstrated remarkable performance in enhancing zero-shot capabilities and has …