Q Li, J Cheng, Y Gao, J Li - … on Circuits and Systems for Video …, 2024 - ieeexplore.ieee.org
With the emergence of Vision Transformers, attention-based modules have demonstrated
comparable or superior performance in comparison to CNNs on various vision tasks …