H Sun,
Y Song, J Hu,
X Yu, J Liu, YW Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in large-scale models have showcased remarkable generalization
capabilities in various tasks. However, integrating multimodal processing into these models …