X Chu, L Qiao, X Lin, S Xu, Y Yang, Y Hu,
F Wei… - arXiv preprint arXiv …, 2023 - arxiv.org
We present MobileVLM, a competent multimodal vision language model (MMVLM) targeted
to run on mobile devices. It is an amalgamation of a myriad of architectural designs and …