F Shen, Y Xie, J Zhu, X Zhu, H Zeng - arXiv preprint arXiv:2107.05475, 2021 - arxiv.org
Transformers are more and more popular in computer vision, which treat an image as a
sequence of patches and learn robust global features from the sequence. However, pure …