Robust online video instance segmentation with track queries

T Zhang, X Tian, Y Wu, S Ji, X Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Video instance segmentation (VIS) is a critical task with diverse applications, including
autonomous driving and video editing. Existing methods often underperform on complex …

被引用次数：51 相关文章所有 7 个版本

[PDF] arxiv.org

Segment anything meets point tracking

F Rajič, L Ke, YW Tai, CK Tang, M Danelljan… - arXiv preprint arXiv …, 2023 - arxiv.org

The Segment Anything Model (SAM) has established itself as a powerful zero-shot image
segmentation model, employing interactive prompts such as points to generate masks. This …

被引用次数：62 相关文章所有 2 个版本

[PDF] thecvf.com

Tcovis: Temporally consistent online video instance segmentation

J Li, B Yu, Y Rao, J Zhou, J Lu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

In recent years, significant progress has been made in video instance segmentation (VIS),
with many offline and online methods achieving state-of-the-art performance. While offline …

被引用次数：17 相关文章所有 5 个版本

[PDF] arxiv.org

Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation

Z Cheng, K Li, H Li, P Jin, C Liu, X Zheng, R Ji… - arXiv preprint arXiv …, 2024 - arxiv.org

Temporally locating objects with arbitrary class texts is the primary pursuit of open-
vocabulary Video Instance Segmentation (VIS). Because of the insufficient vocabulary of …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Gratt-vis: Gated residual attention for auto rectifying video instance segmentation

T Hannan, R Koner, M Bernhard, S Shit… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent trends in Video Instance Segmentation (VIS) have seen a growing reliance on online
methods to model complex and lengthy video sequences. However, the degradation of …

被引用次数：7 相关文章所有 2 个版本

Learning Better Video Query with SAM for Video Instance Segmentation

H Fang, T Zhang, X Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Recently, Transformer-based offline video instance segmentation (VIS) solutions have made
significant progress by decomposing the whole task into global segmentation map …

被引用次数：7 相关文章

[PDF] arxiv.org

SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

T Zhou, W Luo, Q Ye, Z Shi, J Chen - arXiv preprint arXiv:2403.04194, 2024 - arxiv.org

Recently, promptable segmentation models, such as the Segment Anything Model (SAM),
have demonstrated robust zero-shot generalization capabilities on static images. These …

被引用次数：2 相关文章所有 2 个版本

[PDF] bmvc2023.org

[PDF][PDF] End-to-end Amodal Video Instance Segmentation.

J Breitenstein, K Jin, A Hakiri… - BMVC …, 2023 - workshops.proceedings.bmvc2023 …

Amodal perception is the important ability of humans to imagine the entire shape of
occluded objects. This ability is crucial for safety-relevant perception tasks such as …

被引用次数：2 相关文章

GRAtt-VIS: Gated Residual Attention for Video Instance Segmentation

T Hannan, R Koner, M Bernhard, S Shit… - … Conference on Pattern …, 2024 - Springer

Abstract Video Instance Segmentation (VIS) has seen a growing reliance on query
propagation-based methods to model complex and lengthy videos. While these methods …

高级搜索

QQ 群