作者
Bocen Li, Guozhen Li, Haiting Wang, Lijun Wang, Zhenfei Gong, Xiaohua Zhang, Huchuan Lu
发表日期
2021
研讨会论文
Pattern Recognition and Computer Vision: 4th Chinese Conference, PRCV 2021, Beijing, China, October 29–November 1, 2021, Proceedings, Part IV 4
页码范围
263-275
出版商
Springer International Publishing
简介
In contrast to traditional transformer blocks using a set of pre-defined parameters as positional embeddings, we propose the input-aware positional embedding (IPE) which is dynamically generated according to the input feature. We implement this idea by designing the IPE transformer, which enjoys stronger generalization powers across arbitrary input sizes. To verify its effectiveness, we integrate the newly-designed transformer into NLSPN and GuideNet, two remarkable depth completion networks. The experimental result on a large scale outdoor depth completion dataset shows that the proposed transformer can effectively model long-range dependency with a manageable memory overhead.
引用总数
学术搜索中的文章
B Li, G Li, H Wang, L Wang, Z Gong, X Zhang, H Lu - Pattern Recognition and Computer Vision: 4th Chinese …, 2021