On the challenges and perspectives of foundation models for medical image analysis

S Zhang, D Metaxas - Medical Image Analysis, 2023 - Elsevier
This article discusses the opportunities, applications and future directions of large-scale
pretrained models, ie, foundation models, which promise to significantly improve the …

U-mamba: Enhancing long-range dependency for biomedical image segmentation

J Ma, F Li, B Wang - arXiv preprint arXiv:2401.04722, 2024 - arxiv.org
Convolutional Neural Networks (CNNs) and Transformers have been the most popular
architectures for biomedical image segmentation, but both of them have limited ability to …

Swin-umamba: Mamba-based unet with imagenet-based pretraining

J Liu, H Yang, HY Zhou, Y Xi, L Yu, Y Yu… - arXiv preprint arXiv …, 2024 - arxiv.org
Accurate medical image segmentation demands the integration of multi-scale information,
spanning from local features to global dependencies. However, it is challenging for existing …

Abdomenatlas-8k: Annotating 8,000 CT volumes for multi-organ segmentation in three weeks

C Qu, T Zhang, H Qiao, Y Tang… - Advances in Neural …, 2024 - proceedings.neurips.cc
Annotating medical images, particularly for organ segmentation, is laborious and time-
consuming. For example, annotating an abdominal organ requires an estimated rate of 30 …

Foundation model for advancing healthcare: Challenges, opportunities, and future directions

Y He, F Huang, X Jiang, Y Nie, M Wang, J Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Foundation model, which is pre-trained on broad data and is able to adapt to a wide range
of tasks, is advancing healthcare. It promotes the development of healthcare artificial …

Versatile medical image segmentation learned from multi-source datasets via model self-disambiguation

X Chen, H Zheng, Y Li, Y Ma, L Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com
A versatile medical image segmentation model applicable to images acquired with diverse
equipment and protocols can facilitate model deployment and maintenance. However …

Large window-based mamba unet for medical image segmentation: Beyond convolution and self-attention

J Wang, J Chen, D Chen, J Wu - arXiv preprint arXiv:2403.07332, 2024 - arxiv.org
In clinical practice, medical image segmentation provides useful information on the contours
and dimensions of target organs or tissues, facilitating improved diagnosis, analysis, and …

Sa-med2d-20m dataset: Segment anything in 2d medical imaging with 20 million masks

J Ye, J Cheng, J Chen, Z Deng, T Li, H Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Segment Anything Model (SAM) has achieved impressive results for natural image
segmentation with input prompts such as points and bounding boxes. Its success largely …

AbdomenAtlas: A large-scale, detailed-annotated, & multi-center dataset for efficient transfer learning and open algorithmic benchmarking

W Li, C Qu, X Chen, PRAS Bassi, Y Shi, Y Lai… - Medical Image …, 2024 - Elsevier
We introduce the largest abdominal CT dataset (termed AbdomenAtlas) of 20,460 three-
dimensional CT volumes sourced from 112 hospitals across diverse populations …

Gmai-mmbench: A comprehensive multimodal evaluation benchmark towards general medical ai

P Chen, J Ye, G Wang, Y Li, Z Deng, W Li, T Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Vision-Language Models (LVLMs) are capable of handling diverse data types such as
imaging, text, and physiological signals, and can be applied in various fields. In the medical …