V Udandarao - Master's thesis, University of Cambridge, 2022 - mlmi.eng.cam.ac.uk
Contrastive language-image pre-training has emerged to be a simple yet effective way to
train largescale vision-language models [165, 83, 181, 220] that are capable of learning …