作者
Qiming Zhang, Jing Zhang, Yufei Xu, Dacheng Tao
发表日期
2024/1/8
期刊
IEEE Transactions on Pattern Analysis and Machine Intelligence
出版商
IEEE
简介
Window-based attention has become a popular choice in vision transformers due to its superior performance, lower computational complexity, and less memory footprint. However, the design of hand-crafted windows, which is data-agnostic, constrains the flexibility of transformers to adapt to objects of varying sizes, shapes, and orientations. To address this issue, we propose a novel quadrangle attention (QA) method that extends the window-based attention to a general quadrangle formulation. Our method employs an end-to-end learnable quadrangle regression module that predicts a transformation matrix to transform default windows into target quadrangles for token sampling and attention calculation, enabling the network to model various targets with different shapes and orientations and capture rich context information. We integrate QA into plain and hierarchical vision transformers to create a new architecture …
引用总数
学术搜索中的文章
Q Zhang, J Zhang, Y Xu, D Tao - IEEE Transactions on Pattern Analysis and Machine …, 2024