Holistic evaluation of gpt-4v for biomedical imaging

Z Liu, H Jiang, T Zhong, Z Wu, C Ma, Y Li, X Yu… - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and
limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial …

Can gpt-4v (ision) serve medical applications? case studies on gpt-4v for multimodal medical diagnosis

C Wu, J Lei, Q Zheng, W Zhao, W Lin, X Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Driven by the large foundation models, the development of artificial intelligence has
witnessed tremendous progress lately, leading to a surge of general interest from the public …

Advancing medical imaging with language models: A journey from n-grams to chatgpt

M Hu, S Pan, Y Li, X Yang - arXiv preprint arXiv:2304.04920, 2023 - arxiv.org
In this paper, we aimed to provide a review and tutorial for researchers in the field of medical
imaging using language models to improve their tasks at hand. We began by providing an …

Language models are free boosters for biomedical imaging tasks

Z Lai, J Wu, S Chen, Y Zhou, A Hovakimyan… - arXiv preprint arXiv …, 2024 - arxiv.org
In this study, we uncover the unexpected efficacy of residual-based large language models
(LLMs) as part of encoders for biomedical imaging tasks, a domain traditionally devoid of …

When vision meets reality: Exploring the clinical applicability of GPT-4 with vision

J Deng, K Heybati, M Shammas-Toma - Clinical Imaging, 2024 - Elsevier
In November 2023, OpenAI introduced the latest iteration of ChatGPT, which integrated a
novel architecture called Generative Pre-trained Transformer (GPT) 4 with vision capabilities …

Multimodal chatgpt for medical applications: an experimental study of gpt-4v

Z Yan, K Zhang, R Zhou, L He, X Li, L Sun - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we critically evaluate the capabilities of the state-of-the-art multimodal large
language model, ie, GPT-4 with Vision (GPT-4V), on Visual Question Answering (VQA) task …

Assessing GPT-4 multimodal performance in radiological image analysis

D Brin, V Sorin, Y Barash, E Konen, G Nadkarni… - medRxiv, 2023 - medrxiv.org
Objectives This study aims to assess the performance of OpenAI's multimodal GPT-4, which
can analyze both images and textual data (GPT-4V), in interpreting radiological images. It …

[HTML][HTML] Advancing medical imaging with language models: featuring a spotlight on ChatGPT

M Hu, J Qian, S Pan, Y Li, RLJ Qiu… - Physics in Medicine & …, 2024 - iopscience.iop.org
This review paper aims to serve as a comprehensive guide and instructional resource for
researchers seeking to effectively implement language models in medical imaging research …

[HTML][HTML] From text to image: challenges in integrating vision into ChatGPT for medical image interpretation

S Koga, W Du - Neural Regeneration Research, 2025 - journals.lww.com
Large language models (LLMs), such as ChatGPT developed by OpenAI, represent a
significant advancement in artificial intelligence (AI), designed to understand, generate, and …

Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks

Z Lai, J Wu, S Chen, Y Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this study we uncover the unexpected efficacy of residual-based large language models
(LLMs) as part of encoders for biomedical imaging tasks a domain traditionally devoid of …