A survey of resource-efficient llm and multimodal foundation models

M Xu, W Yin, D Cai, R Yi, D Xu, Q Wang, B Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large foundation models, including large language models (LLMs), vision transformers
(ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine …

Towards efficient generative large language model serving: A survey from algorithms to systems

X Miao, G Oliaro, Z Zhang, X Cheng, H Jin… - arXiv preprint arXiv …, 2023 - arxiv.org
In the rapidly evolving landscape of artificial intelligence (AI), generative large language
models (LLMs) stand at the forefront, revolutionizing how we interact with our data. However …

Mm-llms: Recent advances in multimodal large language models

D Zhang, Y Yu, C Li, J Dong, D Su, C Chu… - arXiv preprint arXiv …, 2024 - arxiv.org
In the past year, MultiModal Large Language Models (MM-LLMs) have undergone
substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs …

LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model

M Hinck, ML Olson, D Cobbley, SY Tseng… - arXiv preprint arXiv …, 2024 - arxiv.org
We train a suite of multimodal foundation models (MMFM) using the popular LLaVA
framework with the recently released Gemma family of large language models (LLMs). Of …

[PDF][PDF] The dawn of lmms: Preliminary explorations with gpt-4v (ision)

Z Yang, L Li, K Lin, J Wang, CC Lin… - arXiv preprint arXiv …, 2023 - stableaiprompts.com
Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory
skills, such as visual understanding, to achieve stronger generic intelligence. In this paper …

Foundation and large language models: fundamentals, challenges, opportunities, and social impacts

D Myers, R Mohawesh, VI Chellaboina, AL Sathvik… - Cluster …, 2024 - Springer
Abstract Foundation and Large Language Models (FLLMs) are models that are trained using
a massive amount of data with the intent to perform a variety of downstream tasks. FLLMs …

Tinygpt-v: Efficient multimodal large language model via small backbones

Z Yuan, Z Li, L Sun - arXiv preprint arXiv:2312.16862, 2023 - arxiv.org
In the era of advanced multimodel learning, multimodal large language models (MLLMs)
such as GPT-4V have made remarkable strides towards bridging language and visual …

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

L Zhang, X Liu, Z Li, X Pan, P Dong, R Fan… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) have seen great advance in both academia and industry,
and their popularity results in numerous open-source frameworks and techniques in …

Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects

MU Hadi, R Qureshi, A Shah, M Irfan, A Zafar… - Authorea …, 2023 - techrxiv.org
Within the vast expanse of computerized language processing, a revolutionary entity known
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …

Llm augmented llms: Expanding capabilities through composition

R Bansal, B Samanta, S Dalmia, N Gupta… - arXiv preprint arXiv …, 2024 - arxiv.org
Foundational models with billions of parameters which have been trained on large corpora
of data have demonstrated non-trivial skills in a variety of domains. However, due to their …