Vision-language pre-training: Basics, recent advances, and future trends

Z Gan, L Li, C Li, L Wang, Z Liu… - Foundations and Trends …, 2022 - nowpublishers.com
This monograph surveys vision-language pre-training (VLP) methods for multimodal
intelligence that have been developed in the last few years. We group these approaches …

Vision-Language Pre-training: Basics, Recent Advances, and Future Trends

Z Gan, L Li, C Li, L Wang, Z Liu, J Gao - arXiv preprint arXiv:2210.09263, 2022 - arxiv.org
This paper surveys vision-language pre-training (VLP) methods for multimodal intelligence
that have been developed in the last few years. We group these approaches into three …

[引用][C] Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends

Z Gan, L Li, C Li, L Wang, Z Liu, J Gao - Foundations and Trends® in …, 2022 - cir.nii.ac.jp
Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends | CiNii Research
CiNii 国立情報学研究所 学術情報ナビゲータ[サイニィ] 詳細へ移動 検索フォームへ移動 論文・データを …

Vision-Language Pre-training: Basics, Recent Advances, and Future Trends

Z Gan, L Li, C Li, L Wang, Z Liu, J Gao - arXiv e-prints, 2022 - ui.adsabs.harvard.edu
This paper surveys vision-language pre-training (VLP) methods for multimodal intelligence
that have been developed in the last few years. We group these approaches into three …

Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends

Z Gan, L Li, C Li, L Wang, Z Liu, J Gao - 2022 - ieeexplore.ieee.org
Humans perceive the world through many channels, such as images viewed by the eyes, or
voices heard by the ears. Though any individual channel might be incomplete or noisy …

Vision-Language Pre-Training:: Basics, Recent Advances, and Future Trends

Z Gan, L Li, C Li, L Wang, Z Liu, J Gao - Foundations and Trends® in …, 2022 - dl.acm.org
This monograph surveys vision-language pre-training (VLP) methods for multimodal
intelligence that have been developed in the last few years. We group these approaches …

Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends

Z Gan, L Li, C Li, L Wang, Z Liu, J Gao - 2022 - ieeexplore.ieee.org
Humans perceive the world through many channels, such as images viewed by the eyes, or
voices heard by the ears. Though any individual channel might be incomplete or noisy …