Non-local aggregation for RGB-D semantic segmentation

G Zhang, JH Xue, P Xie, S Yang… - IEEE Signal Processing …, 2021 - ieeexplore.ieee.org
G Zhang, JH Xue, P Xie, S Yang, G Wang
IEEE Signal Processing Letters, 2021ieeexplore.ieee.org
Exploiting both RGB (2D appearance) and Depth (3D geometry) information can improve
the performance of semantic segmentation. However, due to the inherent difference
between the RGB and Depth information, it remains a challenging problem in how to
integrate RGB-D features effectively. In this letter, to address this issue, we propose a Non-
local Aggregation Network (NANet), with a well-designed Multi-modality Non-local
Aggregation Module (MNAM), to better exploit the non-local context of RGB-D features at …
Exploiting both RGB (2D appearance) and Depth (3D geometry) information can improve the performance of semantic segmentation. However, due to the inherent difference between the RGB and Depth information, it remains a challenging problem in how to integrate RGB-D features effectively. In this letter, to address this issue, we propose a Non-local Aggregation Network (NANet), with a well-designed Multi-modality Non-local Aggregation Module (MNAM), to better exploit the non-local context of RGB-D features at multi-stage. Compared with most existing RGB-D semantic segmentation schemes, which only exploit local RGB-D features, the MNAM enables the aggregation of non-local RGB-D information along both spatial and channel dimensions. The proposed NANet achieves comparable performances with state-of-the-art methods on popular RGB-D benchmarks, NYUDv2 and SUN-RGBD.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果