A framework for human evaluation of large language models in healthcare derived from literature review

TYC Tam, S Sivarajkumar, S Kapoor, AV Stolyar… - NPJ Digital …, 2024 - nature.com
With generative artificial intelligence (GenAI), particularly large language models (LLMs),
continuing to make inroads in healthcare, assessing LLMs with human evaluations is …

Generative artificial intelligence in healthcare: A scoping review on benefits, challenges and applications

K Moulaei, A Yadegari, M Baharestani… - International Journal of …, 2024 - Elsevier
Background Generative artificial intelligence (GAI) is revolutionizing healthcare with
solutions for complex challenges, enhancing diagnosis, treatment, and care through new …

Large language models in biomedical and health informatics: A review with bibliometric analysis

H Yu, L Fan, L Li, J Zhou, Z Ma, L Xian, W Hua… - Journal of Healthcare …, 2024 - Springer
Large language models (LLMs) have rapidly become important tools in Biomedical and
Health Informatics (BHI), potentially enabling new ways to analyze data, treat patients, and …

The policies on the use of large language models in radiological journals are lacking: a meta-research study

J Zhong, Y Xing, Y Hu, J Lu, J Yang, G Zhang… - Insights into …, 2024 - Springer
Objective To evaluate whether and how the radiological journals present their policies on
the use of large language models (LLMs), and identify the journal characteristic variables …

A Survey for Large Language Models in Biomedicine

C Wang, M Li, J He, Z Wang, E Darzi, Z Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent breakthroughs in large language models (LLMs) offer unprecedented natural
language understanding and generation capabilities. However, existing surveys on LLMs in …

[HTML][HTML] Artificial Intelligence in Multilingual Interpretation and Radiology Assessment for Clinical Language Evaluation (AI-MIRACLE)

P Khanna, G Dhillon, V Buddhavarapu… - Journal of Personalized …, 2024 - mdpi.com
The AI-MIRACLE Study investigates the efficacy of using ChatGPT 4.0, a large language
model (LLM), for translating and simplifying radiology reports into multiple languages, aimed …

A Literature Review and Framework for Human Evaluation of Generative Large Language Models in Healthcare

TYC Tam, S Sivarajkumar, S Kapoor, AV Stolyar… - arXiv preprint arXiv …, 2024 - arxiv.org
As generative artificial intelligence (AI), particularly Large Language Models (LLMs),
continues to permeate healthcare, it remains crucial to supplement traditional automated …

The Role of Artificial Intelligence and Big Data for Gastrointestinal Disease

NM Holt, MF Byrne - Gastrointestinal Endoscopy Clinics, 2024 - giendo.theclinics.com
Artificial intelligence (AI) and the use of “big data”(BD) are rapidly evolving concepts in all
fields and industries, with the potential to both reduce the burden of human effort, particularly …

Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review

T Shi, J Ma, Z Yu, H Xu, M Xiong, M Xiao, Y Li… - arXiv preprint arXiv …, 2024 - arxiv.org
With the rapid development of artificial intelligence (AI), large language models (LLMs) have
shown strong capabilities in natural language understanding, reasoning, and generation …

[PDF][PDF] Is your curriculum GenAI-proof? A method for GenAI impact assessment and a case study

R Jongkind, E Elings, E Joukes, T Broens, H Leopold… - Conference on Human … - osf.io
The introduction of the Generative AI (GenAI) application ChatGPT in November 2022
marked the starting point of capable large language models which, to a certain extent, can …