X Yuan, Z Lin, J Kuen, J Zhang, Y Wang… - 2021 IEEE/CVF …, 2021 - ieeexplore.ieee.org
We develop an approach to learning visual representations that embraces multimodal data,
driven by a combination of intra-and inter-modal similarity preservation objectives. Unlike …