P Hu, X Peng, H Zhu, L Zhen, J Lin - 2021 IEEE/CVF Conference on …, 2021 - computer.org
Recently, cross-modal retrieval is emerging with the help of deep multimodal learning.
However, even for unimodal data, collecting large-scale well-annotated data is expensive …