MPT: Multimodal Prompt Tuning for Zero-shot Instruction Learning

T Wang, Y Liu, JC Liang, Y Cui, Y Mao, S Nie… - arXiv preprint arXiv …, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) demonstrate remarkable performance across
a wide range of domains, with increasing emphasis on enhancing their zero-shot …