Dvis: Decoupled video instance segmentation framework

T Zhang, X Tian, Y Wu, S Ji, X Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video instance segmentation (VIS) is a critical task with diverse applications, including
autonomous driving and video editing. Existing methods often underperform on complex …

Segment anything meets point tracking

F Rajič, L Ke, YW Tai, CK Tang, M Danelljan… - arXiv preprint arXiv …, 2023 - arxiv.org
The Segment Anything Model (SAM) has established itself as a powerful zero-shot image
segmentation model, employing interactive prompts such as points to generate masks. This …

Tcovis: Temporally consistent online video instance segmentation

J Li, B Yu, Y Rao, J Zhou, J Lu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In recent years, significant progress has been made in video instance segmentation (VIS),
with many offline and online methods achieving state-of-the-art performance. While offline …

Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation

Z Cheng, K Li, H Li, P Jin, C Liu, X Zheng, R Ji… - arXiv preprint arXiv …, 2024 - arxiv.org
Temporally locating objects with arbitrary class texts is the primary pursuit of open-
vocabulary Video Instance Segmentation (VIS). Because of the insufficient vocabulary of …

Gratt-vis: Gated residual attention for auto rectifying video instance segmentation

T Hannan, R Koner, M Bernhard, S Shit… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent trends in Video Instance Segmentation (VIS) have seen a growing reliance on online
methods to model complex and lengthy video sequences. However, the degradation of …

Learning Better Video Query with SAM for Video Instance Segmentation

H Fang, T Zhang, X Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Recently, Transformer-based offline video instance segmentation (VIS) solutions have made
significant progress by decomposing the whole task into global segmentation map …

SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

T Zhou, W Luo, Q Ye, Z Shi, J Chen - arXiv preprint arXiv:2403.04194, 2024 - arxiv.org
Recently, promptable segmentation models, such as the Segment Anything Model (SAM),
have demonstrated robust zero-shot generalization capabilities on static images. These …

[PDF][PDF] End-to-end Amodal Video Instance Segmentation.

J Breitenstein, K Jin, A Hakiri… - BMVC …, 2023 - workshops.proceedings.bmvc2023 …
Amodal perception is the important ability of humans to imagine the entire shape of
occluded objects. This ability is crucial for safety-relevant perception tasks such as …

GRAtt-VIS: Gated Residual Attention for Video Instance Segmentation

T Hannan, R Koner, M Bernhard, S Shit… - … Conference on Pattern …, 2024 - Springer
Abstract Video Instance Segmentation (VIS) has seen a growing reliance on query
propagation-based methods to model complex and lengthy videos. While these methods …