Capability of GPT-4V (ision) in Japanese national medical licensing examination

T Nakao, S Miki, Y Nakamura, T Kikuchi, Y Nomura… - medRxiv, 2023 - medrxiv.org
Background: Previous research applying large language models (LLMs) to medicine was
focused on text-based information. Recently, multimodal variants of LLMs acquired the …

[HTML][HTML] Capability of GPT-4V (ision) in the Japanese national medical licensing examination: evaluation study

T Nakao, S Miki, Y Nakamura, T Kikuchi… - JMIR Medical …, 2024 - mededu.jmir.org
Background: Previous research applying large language models (LLMs) to medicine was
focused on text-based information. Recently, multimodal variants of LLMs acquired the …

The performance of the multimodal large language model GPT-4 on the European board of radiology examination sample test

MS Beşler - Japanese Journal of Radiology, 2024 - Springer
I read with great interest the study conducted by Nakaura et al., which investigates the
potential of Generative Pretrained Transformers (GPT) in generating radiology reports [1]. I …

Diagnostic accuracy of GPT multimodal analysis on USMLE questions including text and visuals

V Sorin, BS Glicksberg, Y Barash, E Konen, G Nadkarni… - MedRxiv, 2023 - medrxiv.org
Abstract Objective Large Language Models (LLMs) have demonstrated proficiency in free-
text analysis in healthcare. With recent advancements, GPT-4 now has the capability to …

Performance of a large language model on Japanese emergency medicine board certification examinations

Y Igarashi, K Nakahara, T Norii, N Miyake… - Journal of Nippon …, 2024 - jstage.jst.go.jp
Background: Emergency physicians need a broad range of knowledge and skills to address
critical medical, traumatic, and environmental conditions. Artificial intelligence (AI), including …

The accuracy of large language models in RANZCR's clinical radiology exam sample questions

MS Beşler - Japanese Journal of Radiology, 2024 - Springer
Dear editor, I read with great excitement the study conducted by Nakaura et al., examining
the potential of large language models (LLMs) to serve as copilots or autonomous agents in …

Performance of multimodal GPT-4V on USMLE with Image: potential for imaging diagnostic support with explanations

Z Yang, Z Yao, M Tasmin, P Vashisht, WS Jang… - medRxiv, 2023 - medrxiv.org
Background Using artificial intelligence (AI) to help clinical diagnoses has been an active
research topic for more than six decades. Past research, however, has not had the scale and …

Beyond the Hype: Assessing the Performance, Trustworthiness, and Clinical Suitability of GPT3. 5

S Talebi, E Tong, MRK Mofrad - arXiv preprint arXiv:2306.15887, 2023 - arxiv.org
The use of large language models (LLMs) in healthcare is gaining popularity, but their
practicality and safety in clinical settings have not been thoroughly assessed. In high-stakes …

Evaluating the performance of ChatGPT-4 on the United Kingdom medical licensing assessment

UH Lai, KS Wu, TY Hsu, JKC Kan - Frontiers in Medicine, 2023 - frontiersin.org
Introduction Recent developments in artificial intelligence large language models (LLMs),
such as ChatGPT, have allowed for the understanding and generation of human-like text …

GPT-4 Turbo with Vision fails to outperform text-only GPT-4 Turbo in the Japan Diagnostic Radiology Board Examination

Y Hirano, S Hanaoka, T Nakao, S Miki… - Japanese Journal of …, 2024 - Springer
Purpose To assess the performance of GPT-4 Turbo with Vision (GPT-4TV), OpenAI's latest
multimodal large language model, by comparing its ability to process both text and image …