R Jiang, L Liu, C Chen - arXiv preprint arXiv:2403.10568, 2024 - arxiv.org
Prompt-tuning has demonstrated parameter-efficiency in fusing unimodal foundation models
for multimodal tasks. However, its limited adaptivity and expressiveness lead to suboptimal …