W Xu, Y Liu, L He, X Huang, L Jiang - arXiv preprint arXiv:2405.09215, 2024 - arxiv.org
42 天前 - … -edge multimodal vision language model. It is designed for efficient deployment on
consumer GPU servers. Our … a 1B-scale language model from the ground up, employing the …