Tinygpt-v: Efficient multimodal large language model via small backbones

Z Yuan, Z Li, L Sun - arXiv preprint arXiv:2312.16862, 2023 - arxiv.org
In the era of advanced multimodel learning, multimodal large language models (MLLMs)
such as GPT-4V have made remarkable strides towards bridging language and visual …

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Z Yuan, Z Li, L Sun - arXiv e-prints, 2023 - ui.adsabs.harvard.edu
In the era of advanced multimodel learning, multimodal large language models (MLLMs)
such as GPT-4V have made remarkable strides towards bridging language and visual …

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Z Yuan, Z Li, W Huang, Y Ye, L Sun - 2nd Workshop on Advancing Neural … - openreview.net
In recent years, multimodal large language models (MLLMs) such as GPT-4V have
demonstrated remarkable advancements, excelling in a variety of vision-language tasks …