[HTML][HTML] Review of large vision models and visual prompt engineering

J Wang, Z Liu, L Zhao, Z Wu, C Ma, S Yu, H Dai… - Meta-Radiology, 2023 - Elsevier
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …

A comprehensive survey on segment anything model for vision and beyond

C Zhang, L Liu, Y Cui, G Huang, W Lin, Y Yang… - arXiv preprint arXiv …, 2023 - arxiv.org
Artificial intelligence (AI) is evolving towards artificial general intelligence, which refers to the
ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence …

Ma-sam: Modality-agnostic sam adaptation for 3d medical image segmentation

C Chen, J Miao, D Wu, A Zhong, Z Yan, S Kim… - Medical Image …, 2024 - Elsevier
Abstract The Segment Anything Model (SAM), a foundation model for general image
segmentation, has demonstrated impressive zero-shot performance across numerous …

Segment anything, from space?

S Ren, F Luzi, S Lahrichi, K Kassaw… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently, the first foundation model developed specifically for image segmentation tasks
was developed, termed the" Segment Anything Model"(SAM). SAM can segment objects in …

Surgicalsam: Efficient class promptable surgical instrument segmentation

W Yue, J Zhang, K Hu, Y Xia, J Luo… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
The Segment Anything Model (SAM) is a powerful foundation model that has revolutionised
image segmentation. To apply SAM to surgical instrument segmentation, a common …

Segment anything model for medical image segmentation: Current applications and future directions

Y Zhang, Z Shen, R Jiao - Computers in Biology and Medicine, 2024 - Elsevier
Due to the inherent flexibility of prompting, foundation models have emerged as the
predominant force in the fields of natural language processing and computer vision. The …

Annotation-free audio-visual segmentation

J Liu, Y Wang, C Ju, C Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract The objective of Audio-Visual Segmentation (AVS) is to localise the sounding
objects within visual scenes by accurately predicting pixel-wise segmentation masks. To …

Deep interactive segmentation of medical images: A systematic review and taxonomy

Z Marinov, PF Jäger, J Egger, J Kleesiek… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Interactive segmentation is a crucial research area in medical image analysis aiming to
boost the efficiency of costly annotations by incorporating human feedback. This feedback …

Unleashing the potential of SAM for medical adaptation via hierarchical decoding

Z Cheng, Q Wei, H Zhu, Y Wang, L Qu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract The Segment Anything Model (SAM) has garnered significant attention for its
versatile segmentation abilities and intuitive prompt-based interface. However its application …

[HTML][HTML] Surgical-DINO: adapter learning of foundation models for depth estimation in endoscopic surgery

B Cui, M Islam, L Bai, H Ren - International Journal of Computer Assisted …, 2024 - Springer
Purpose Depth estimation in robotic surgery is vital in 3D reconstruction, surgical navigation
and augmented reality visualization. Although the foundation model exhibits outstanding …