Coarse-to-fine feature mining for video semantic segmentation

G Sun, Y Liu, H Ding, T Probst… - proceedings of the …, 2022 - openaccess.thecvf.com
The contextual information plays a core role in semantic segmentation. As for video
semantic segmentation, the contexts include static contexts and motional contexts …

Isomer: Isomerous transformer for zero-shot video object segmentation

Y Yuan, Y Wang, L Wang, X Zhao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent leading zero-shot video object segmentation (ZVOS) works devote to integrating
appearance and motion information by elaborately designing feature fusion modules and …

Multispectral video semantic segmentation: A benchmark dataset and baseline

W Ji, J Li, C Bian, Z Zhou, J Zhao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Robust and reliable semantic segmentation in complex scenes is crucial for many real-life
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …

Neural video depth stabilizer

Y Wang, M Shi, J Li, Z Huang, Z Cao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video depth estimation aims to infer temporally consistent depth. Some methods achieve
temporal consistency by finetuning a single-image depth model during test time using …

Mining relations among cross-frame affinities for video semantic segmentation

G Sun, Y Liu, H Tang, A Chhatkuli, L Zhang… - … on Computer Vision, 2022 - Springer
The essence of video semantic segmentation (VSS) is how to leverage temporal information
for prediction. Previous efforts are mainly devoted to developing new techniques to calculate …

Mask propagation for efficient video semantic segmentation

Y Weng, M Han, H He, M Li, L Yao… - Advances in …, 2024 - proceedings.neurips.cc
Abstract Video Semantic Segmentation (VSS) involves assigning a semantic label to each
pixel in a video sequence. Prior work in this field has demonstrated promising results by …

Latency matters: Real-time action forecasting transformer

H Girase, N Agarwal, C Choi… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present RAFTformer, a real-time action forecasting transformer for latency aware real-
world action forecasting applications. RAFTformer is a two-stage fully transformer based …

Combining implicit-explicit view correlation for light field semantic segmentation

R Cong, D Yang, R Chen, S Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Since light field simultaneously records spatial information and angular information of light
rays, it is considered to be beneficial for many potential applications, and semantic …

Vanishing-point-guided video semantic segmentation of driving scenes

D Guo, DP Fan, T Lu, C Sakaridis… - Proceedings of the …, 2024 - openaccess.thecvf.com
The estimation of implicit cross-frame correspondences and the high computational cost
have long been major challenges in video semantic segmentation (VSS) for driving scenes …

Motion-state Alignment for Video Semantic Segmentation

J Su, R Yin, S Zhang, J Luo - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
In recent years, video semantic segmentation has made great progress with advanced deep
neural networks. However, there still exist two main challenges ie, information inconsistency …