A comprehensive review of modern object segmentation approaches

Y Wang, U Ahsan, H Li, M Hagen - Foundations and Trends® …, 2022 - nowpublishers.com
Image segmentation is the task of associating pixels in an image with their respective object
class labels. It has a wide range of applications in many industries including healthcare …

GMNet: Graded-feature multilabel-learning network for RGB-thermal urban scene semantic segmentation

W Zhou, J Liu, J Lei, L Yu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Semantic segmentation is a fundamental task in computer vision, and it has various
applications in fields such as robotic sensing, video surveillance, and autonomous driving. A …

A brief survey on RGB-D semantic segmentation using deep learning

C Wang, C Wang, W Li, H Wang - Displays, 2021 - Elsevier
Semantic segmentation is referred to as a process of linking each pixel in an image to a
class label. With this pragmatic technique, it is possible to recognize different objects in an …

Delivering arbitrary-modal semantic segmentation

J Zhang, R Liu, H Shi, K Yang, S Reiß… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multimodal fusion can make semantic segmentation more robust. However, fusing an
arbitrary number of modalities remains underexplored. To delve into this problem, we create …

DRNet: Dual-stage refinement network with boundary inference for RGB-D semantic segmentation of indoor scenes

E Yang, W Zhou, X Qian, J Lei, L Yu - Engineering Applications of Artificial …, 2023 - Elsevier
Semantic segmentation is a dense pixel prediction task, and its accuracy depends on the
extraction of long-range contextual knowledge and refinement of segmentation boundaries …

CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers

J Zhang, H Liu, K Yang, X Hu, R Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …

MTANet: Multitask-aware network with hierarchical multimodal fusion for RGB-T urban scene understanding

W Zhou, S Dong, J Lei, L Yu - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org
Understanding urban scenes is a fundamental ability requirement for assisted driving and
autonomous vehicles. Most of the available urban scene understanding methods use red …

BCINet: Bilateral cross-modal interaction network for indoor scene understanding in RGB-D images

W Zhou, Y Yue, M Fang, X Qian, R Yang, L Yu - Information Fusion, 2023 - Elsevier
Depth cue has proven to be useful information in the indoor scene understanding of RGB-D
images for providing a geometric counterpart to RGB representation. However, because of …

FRNet: Feature reconstruction network for RGB-D indoor scene parsing

W Zhou, E Yang, J Lei, L Yu - IEEE Journal of Selected Topics …, 2022 - ieeexplore.ieee.org
We recently demonstrated the remarkable performance of scene parsing, and one of its
aspects was shown to be relevant to performance, namely, generation of multilevel feature …

Combining implicit-explicit view correlation for light field semantic segmentation

R Cong, D Yang, R Chen, S Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Since light field simultaneously records spatial information and angular information of light
rays, it is considered to be beneficial for many potential applications, and semantic …