Z Zhao, K Hao, X Liu, T Zheng, J Xu, S Cui, C He… - Image and Vision …, 2023 - Elsevier
The visual Transformer model based on self-attention has achieved better performance than
convolutional neural networks in object detection tasks. However, existing visual …