A Review of Large Language Models and Autonomous Agents in Chemistry

MC Ramos, CJ Collison, AD White - arXiv preprint arXiv:2407.01603, 2024 - arxiv.org
Large language models (LLMs) are emerging as a powerful tool in chemistry across multiple
domains. In chemistry, LLMs are able to accurately predict properties, design new …

[HTML][HTML] Contrastive learning penalized cross-entropy with diversity contrastive search decoding for diagnostic report generation of reduced token repetition

T Zhang, J Meng, Y Yang, S Yu - Applied Sciences, 2024 - mdpi.com
Medical imaging description and disease diagnosis are vitally important yet time-consuming.
Automated diagnosis report generation (DRG) from medical imaging description can reduce …

Token-Mixer: Bind Image and Text in One Embedding Space for Medical Image Reporting

Y Yang, J Yu, Z Fu, K Zhang, T Yu… - … on Medical Imaging, 2024 - ieeexplore.ieee.org
Medical image reporting focused on automatically generating the diagnostic reports from
medical images has garnered growing research attention. In this task, learning cross-modal …

MAIRA-2: Grounded Radiology Report Generation

S Bannur, K Bouzid, DC Castro, A Schwaighofer… - arXiv preprint arXiv …, 2024 - arxiv.org
Radiology reporting is a complex task that requires detailed image understanding,
integration of multiple inputs, including comparison with prior imaging, and precise …

Merlin: A Vision Language Foundation Model for 3D Computed Tomography

L Blankemeier, JP Cohen, A Kumar… - arXiv preprint arXiv …, 2024 - arxiv.org
Over 85 million computed tomography (CT) scans are performed annually in the US, of
which approximately one quarter focus on the abdomen. Given the current radiologist …

Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models

X Huang, X Wang, H Zhang, J Xi, J An, H Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Security concerns related to Large Language Models (LLMs) have been extensively
explored, yet the safety implications for Multimodal Large Language Models (MLLMs) …

GREEN: Generative Radiology Report Evaluation and Error Notation

S Ostmeier, J Xu, Z Chen, M Varma… - arXiv preprint arXiv …, 2024 - arxiv.org
Evaluating radiology reports is a challenging problem as factual correctness is extremely
important due to the need for accurate medical communication about medical images …

CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients

P Chambon, JB Delbrouck, T Sounack… - arXiv preprint arXiv …, 2024 - arxiv.org
Since the release of the original CheXpert paper five years ago, CheXpert has become one
of the most widely used and cited clinical AI datasets. The emergence of vision language …

Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning

Z Gu, F Liu, C Yin, P Zhang - arXiv preprint arXiv:2405.11640, 2024 - arxiv.org
The adoption of large language models (LLMs) in healthcare has attracted significant
research interest. However, their performance in healthcare remains under-investigated and …

From Generalist to Specialist: Incorporating Domain-Knowledge into Flamingo for Chest X-Ray Report Generation

R Stock, S Denner, Y Kirchhoff, C Ulrich… - Medical Imaging with …, 2024 - openreview.net
Automating the generation of accurate and reliable radiological reports from chest X-ray
images represents a significant challenge in medical image computing. In this context …