Advances in medical image analysis with vision transformers: a comprehensive review

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier
The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

Clip in medical imaging: A comprehensive survey

Z Zhao, Y Liu, H Wu, M Wang, Y Li, S Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Contrastive Language-Image Pre-training (CLIP), a simple yet effective pre-training
paradigm, successfully introduces text supervision to vision models. It has shown promising …

H-ViT: A Hierarchical Vision Transformer for Deformable Image Registration

M Ghahremani, M Khateri, B Jian… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper introduces a novel top-down representation approach for deformable image
registration which estimates the deformation field by capturing various short-and long-range …

Enhancing lesion detection in automated breast ultrasound using unsupervised multi-view contrastive learning with 3D DETR

X Tao, Y Cao, Y Jiang, X Wu, D Yan, W Xue… - Medical Image …, 2025 - Elsevier
The inherent variability of lesions poses challenges in leveraging AI in 3D automated breast
ultrasound (ABUS) for lesion detection. Traditional methods based on single scans have …

DeViDe: Faceted medical knowledge for improved medical vision-language pre-training

H Luo, Z Zhou, C Royer, A Sekuboyina… - arXiv preprint arXiv …, 2024 - arxiv.org
Vision-language pre-training for chest X-rays has made significant strides, primarily by
utilizing paired radiographs and radiology reports. However, existing approaches often face …

Deep Learning-Based Detect-Then-Track Pipeline for Treatment Outcome Assessments in Immunotherapy-Treated Liver Cancer

J Zhou, Y Xia, X Xun, Z Yu - Journal of Imaging Informatics in Medicine, 2024 - Springer
Accurate treatment outcome assessment is crucial in clinical trials. However, due to the
image-reading subjectivity, there exist discrepancies among different radiologists. The …

Organ-DETR: 3D Organ Detection Transfomer with Multiscale Attention and Dense Query Matching

M GHAHREMANI, BR Ernhofer, J Wang, C Wachinger - openreview.net
Query-based Transformers have been yielding impressive results in object detection. The
potential of DETR-like methods for 3D data, especially in volumetric medical imaging …

[PDF][PDF] Advances in Medical Image Analysis with Vision Transformers: A Comprehensive

R Azad, A Kazerouni, M Heidari, EK Aghdam, A Molaei… - academia.edu
The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …