Uc2: Universal cross-lingual cross-modal vision-and-language pre-training

M Zhou, L Zhou, S Wang, Y Cheng… - Proceedings of the …, 2021 - openaccess.thecvf.com
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

[PDF][PDF] UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu - researchgate.net
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu… - arXiv e …, 2021 - ui.adsabs.harvard.edu
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu… - arXiv preprint arXiv …, 2021 - arxiv.org
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

[PDF][PDF] UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu - air.tsinghua.edu.cn
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

M Zhou, L Zhou, S Wang, Y Cheng, L Li… - 2021 IEEE/CVF …, 2021 - computer.org
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

M Zhou, L Zhou, S Wang, Y Cheng, L Li… - 2021 IEEE/CVF …, 2021 - ieeexplore.ieee.org
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

[PDF][PDF] UC 2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu - openaccess.thecvf.com
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

[PDF][PDF] UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu - air.tsinghua.edu.cn
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …