SwinNet: Swin transformer drives edge-aware RGB-D and RGB-T salient object detection

Z Liu, Y Tan, Q He, Y Xiao - … on Circuits and Systems for Video …, 2021 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) are good at extracting contexture features within
certain receptive fields, while transformers can model the global long-range dependency …

CGINet: Cross-modality grade interaction network for RGB-T crowd counting

Y Pan, W Zhou, X Qian, S Mao, R Yang, L Yu - Engineering Applications of …, 2023 - Elsevier
Crowd counting is a fundamental and challenging task that requires rich information to
generate a pixel-level crowd density map. Additionally, the development of thermal sensing …

CATNet: A cascaded and aggregated transformer network for RGB-D salient object detection

F Sun, P Ren, B Yin, F Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Salient object detection (SOD) is an important preprocessing operation for various computer
vision tasks. Most of existing RGB-D SOD models employ additive or connected strategies to …

Embedded control gate fusion and attention residual learning for RGB–thermal urban scene parsing

W Zhou, Y Lv, J Lei, L Yu - IEEE Transactions on Intelligent …, 2023 - ieeexplore.ieee.org
The semantic segmentation of road scenes is an important task in autonomous driving.
Deep learning has enabled the development of a variety of semantic segmentation networks …

Modality-induced transfer-fusion network for RGB-D and RGB-T salient object detection

G Chen, F Shao, X Chai, H Chen… - … on Circuits and …, 2022 - ieeexplore.ieee.org
The ability of capturing the complementary information of multi-modality data is critical to the
development of multi-modality salient object detection (SOD). Most of existing studies …

Boundary-guided network for camouflaged object detection

T Chen, J Xiao, X Hu, G Zhang, S Wang - Knowledge-based systems, 2022 - Elsevier
Compared with the traditional object segmentation/detection, camouflaged object detection
is much more difficult due to the indefinable boundaries and high intrinsic similarities …

Mvsalnet: Multi-view augmentation for rgb-d salient object detection

J Zhou, L Wang, H Lu, K Huang, X Shi, B Liu - European Conference on …, 2022 - Springer
RGB-D salient object detection (SOD) enjoys significant advantages in understanding 3D
geometry of the scene. However, the geometry information conveyed by depth maps are …

VP-Net: Voxels as points for 3-D object detection

Z Song, H Wei, C Jia, Y Xia, X Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The 3-D object detection with light detection and ranging (LiDAR) point clouds is a
challenging problem, which requires 3-D scene understanding, yet this task is critical to …

DVSOD: RGB-D video salient object detection

J Li, W Ji, S Wang, W Li - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Salient object detection (SOD) aims to identify standout elements in a scene, with recent
advancements primarily focused on integrating depth data (RGB-D) or temporal data from …

PGDENet: Progressive guided fusion and depth enhancement network for RGB-D indoor scene parsing

W Zhou, E Yang, J Lei, J Wan… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Scene parsing is a fundamental task in computer vision. Various RGB-D (color and depth)
scene parsing methods based on fully convolutional networks have achieved excellent …