M Xu, W Yin, D Cai, R Yi, D Xu, Q Wang, B Wu… - arXiv e …, 2024 - ui.adsabs.harvard.edu
Large foundation models, including large language models (LLMs), vision transformers
(ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine …