A framework for human evaluation of large language models in healthcare derived from literature review

TYC Tam, S Sivarajkumar, S Kapoor, AV Stolyar… - NPJ Digital …, 2024 - nature.com
With generative artificial intelligence (GenAI), particularly large language models (LLMs),
continuing to make inroads in healthcare, assessing LLMs with human evaluations is …

Large language model use in clinical oncology

N Carl, F Schramm, S Haggenmüller, JN Kather… - NPJ Precision …, 2024 - nature.com
Large language models (LLMs) are undergoing intensive research for various healthcare
domains. This systematic review and meta-analysis assesses current applications …

Preventing harm from non-conscious bias in medical generative AI

J Hastings - The Lancet Digital Health, 2024 - thelancet.com
Large language models such as OpenAI's GPT-4 have the potential to transform medicine1
by enabling automation of a range of tasks, including writing discharge summaries, 2 …

[HTML][HTML] Prompt engineering paradigms for medical applications: Scoping review

J Zaghir, M Naguib, M Bjelogrlic, A Névéol… - Journal of Medical …, 2024 - jmir.org
Background Prompt engineering, focusing on crafting effective prompts to large language
models (LLMs), has garnered attention for its capabilities at harnessing the potential of …

Fine-tuning a local LLaMA-3 large language model for automated privacy-preserving physician letter generation in radiation oncology

Y Hou, C Bert, A Gomaa, G Lahmer, D Höfler… - Frontiers in Artificial …, 2025 - frontiersin.org
Introduction Generating physician letters is a time-consuming task in daily clinical practice.
Methods This study investigates local fine-tuning of large language models (LLMs) …

Evaluating ChatGPT's competency in radiation oncology: A comprehensive assessment across clinical scenarios

S Ramadan, A Mutsaers, PHC Chen, G Bauman… - Radiotherapy and …, 2025 - Elsevier
Purpose Artificial intelligence (AI) and machine learning present an opportunity to enhance
clinical decision-making in radiation oncology. This study aims to evaluate the competency …

The accuracy of artificial intelligence ChatGPT in oncology examination questions

R Chow, S Hasan, A Zheng, C Gao, G Valdes… - Journal of the American …, 2024 - Elsevier
The aim of this study is to assess the accuracy of Chat Generative Pretrained Transformer
(ChatGPT) in response to oncology examination questions in the setting of one-shot …

[HTML][HTML] AI-Enhanced Healthcare: Integrating ChatGPT-4 in ePROs for Improved Oncology Care and Decision-Making: A Pilot Evaluation

C Liao, C Chu, M Lien, Y Wu, T Wang - Current Oncology, 2024 - mdpi.com
Background: Since 2023, ChatGPT-4 has been impactful across several sectors including
healthcare, where it aids in medical information analysis and education. Electronic patient …

Artificial Intelligence, Machine Learning and Big Data in Radiation Oncology

S Zhu, SJ Ma, A Farag, T Huerta… - Hematology …, 2025 - hemonc.theclinics.com
Artificial Intelligence, Machine Learning and Big Data in Radiation Oncology - Hematology/Oncology
Clinics Skip to Main Content Skip to Main Menu Advertisement Hematology/Oncology Clinics …

A Literature Review and Framework for Human Evaluation of Generative Large Language Models in Healthcare

TYC Tam, S Sivarajkumar, S Kapoor, AV Stolyar… - arXiv preprint arXiv …, 2024 - arxiv.org
As generative artificial intelligence (AI), particularly Large Language Models (LLMs),
continues to permeate healthcare, it remains crucial to supplement traditional automated …