A foundation model utilizing chest ct volumes and radiology reports for supervised-level zero-shot detection of abnormalities

IE Hamamci, S Er, F Almas, AG Simsek, SN Esirgun… - CoRR, 2024 - openreview.net
While computer vision has achieved tremendous success with multimodal encoding and
direct textual interaction with images via chat-based large language models, similar …

Comprehensive Evaluation of Multimodal AI Models in Medical Imaging Diagnosis: From Data Augmentation to Preference-Based Comparison

C Ruan, C Huang, Y Yang - arXiv preprint arXiv:2412.05536, 2024 - arxiv.org
This study introduces an evaluation framework for multimodal models in medical imaging
diagnostics. We developed a pipeline incorporating data preprocessing, model inference …

[HTML][HTML] Opportunities and challenges in the application of large artificial intelligence models in radiology

L Pan, Z Zhao, Y Lu, K Tang, L Fu, Q Liang, S Peng - Meta-Radiology, 2024 - Elsevier
Influenced by ChatGPT, artificial intelligence (AI) large models have witnessed a global
upsurge in large model research and development. As people enjoy the convenience by this …

Large Language Model for Medical Images: A Survey of Taxonomy, Systematic Review, and Future Trends

P Wang, W Lu, C Lu, R Zhou, M Li… - Big Data Mining and …, 2025 - ieeexplore.ieee.org
The advent of Large Language Models (LLMs) has sparked considerable interest in the
medical image domain, as they can generalize to multiple tasks and offer outstanding …

Impact of Multimodal Prompt Elements on Diagnostic Performance of GPT-4V in Challenging Brain MRI Cases

S Schramm, S Preis, MC Metz, K Jung, B Schmitz-Koep… - Radiology, 2025 - pubs.rsna.org
Background Studies have explored the application of multimodal large language models
(LLMs) in radiologic differential diagnosis. Yet, how different multimodal input combinations …

Toward Foundation Models in Radiology? Quantitative Assessment of GPT-4V's Multimodal and Multianatomic Region Capabilities

QD Strotzer, F Nieberle, LS Kupke, G Napodano… - Radiology, 2024 - pubs.rsna.org
Background Large language models have already demonstrated potential in medical text
processing. GPT-4V, a large vision-language model from OpenAI, has shown potential for …

SLaVA-CXR: Small Language and Vision Assistant for Chest X-ray Report Automation

J Wu, Y Kim, D Shi, D Cliffton, F Liu, H Wu - arXiv preprint arXiv …, 2024 - arxiv.org
Inspired by the success of large language models (LLMs), there is growing research interest
in developing LLMs in the medical domain to assist clinicians. However, for hospitals, using …

Enhancing human-computer interaction in chest x-ray analysis using vision and language model with eye gaze patterns

Y Kim, J Wu, Y Abdulle, Y Gao, H Wu - International Conference on …, 2024 - Springer
Abstract Recent advancements in Computer Assisted Diagnosis have shown promising
performance in medical imaging tasks, particularly in chest X-ray analysis. However, the …

The Impact of Aligning Artificial Intelligence Large Language Models With Bloom's Taxonomy in Healthcare Education

M Pears, ST Konstantinidis - Disruptive Technologies in Education …, 2024 - igi-global.com
The innovation of large language models (LLMs) has widened possibilities for renovating
healthcare education through AI-powered learning resources, such as chatbots. This …

Buffalo: Biomedical Vision-Language Understanding with Cross-Modal Prototype and Federated Foundation Model Collaboration

B Yan, Q Chen, Y Chen, X Jiang, W Huang… - Proceedings of the 33rd …, 2024 - dl.acm.org
Federated learning (FL) enables collaborative learning across multiple biomedical data silos
with multimodal foundation models while preserving privacy. Due to the heterogeneity in …