J Li, C Zhu, M Zhao, X Xu, L Zhao, W Cheng, S Liu… - Bioengineering, 2024 - mdpi.com
… [26] proposed Vision Transformer (ViT) model, which achieved state-of-the-art on ImageNet
classification by directly applying Transformers with global self-attention to full-sized images. …