作者
Runwei Guan, Ka Lok Man, Haocheng Zhao, Ruixiao Zhang, Shanliang Yao, Jeremy Smith, Eng Gee Lim, Yutao Yue
发表日期
2023/2
期刊
The Journal of Supercomputing
卷号
79
期号
2
页码范围
2108-2136
出版商
Springer US
简介
CNNs have achieved remarkable image classification and object detection results over the past few years. Due to the locality of the convolution operation, although CNNs can extract rich features of the object itself, they can hardly obtain global context in images. It means the CNN-based network is not a good candidate for detecting objects by utilizing the information of the nearby objects, especially when the partially obscured object is hard to detect. ViTs can get a rich context and dramatically improve the prediction in complex scenes with multi-head self-attention. However, it suffers from long inference time and huge parameters, which leads ViT-based detection network that is hardly be deployed in the real-time detection system. In this paper, firstly, we design a novel plug-and-play attention module called mix attention (MA). MA combines channel, spatial and global contextual attention together. It enhances the …
引用总数
学术搜索中的文章
R Guan, KL Man, H Zhao, R Zhang, S Yao, J Smith… - The Journal of Supercomputing, 2023