J Yang, C Li, P Zhang, B Xiao, C Liu, L Yuan… - arXiv preprint arXiv …, 2022 - arxiv.org
Visual recognition is recently learned via either supervised learning on human-annotated
image-label data or language-image contrastive learning with webly-crawled image-text …