Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com
All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

Tracking anything with decoupled video segmentation

HK Cheng, SW Oh, B Price… - Proceedings of the …, 2023 - openaccess.thecvf.com
Training data for video segmentation are expensive to annotate. This impedes extensions of
end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary …

A generalist framework for panoptic segmentation of images and videos

T Chen, L Li, S Saxena, G Hinton… - Proceedings of the …, 2023 - openaccess.thecvf.com
Panoptic segmentation assigns semantic and instance ID labels to every pixel of an image.
As permutations of instance IDs are also valid solutions, the task requires learning of high …

[HTML][HTML] Coarse-to-fine video instance segmentation with factorized conditional appearance flows

Z Qin, X Lu, X Nie, D Liu, Y Yin, W Wang - IEEE/CAA Journal of …, 2023 - ieee-jas.net
We introduce a novel method using a new generative model that automatically learns
effective representations of the target and background appearance to detect, segment and …

In defense of online models for video instance segmentation

J Wu, Q Liu, Y Jiang, S Bai, A Yuille, X Bai - European Conference on …, 2022 - Springer
In recent years, video instance segmentation (VIS) has been largely advanced by offline
models, while online models gradually attracted less attention possibly due to their inferior …

Language as queries for referring video object segmentation

J Wu, Y Jiang, P Sun, Z Yuan… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Referring video object segmentation (R-VOS) is an emerging cross-modal task that aims to
segment the target object referred by a language expression in all video frames. In this work …

Tube-Link: A flexible cross tube framework for universal video segmentation

X Li, H Yuan, W Zhang, G Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video segmentation aims to segment and track every pixel in diverse scenarios accurately.
In this paper, we present Tube-Link, a versatile framework that addresses multiple core tasks …

Video k-net: A simple, strong, and unified baseline for video segmentation

X Li, W Zhang, J Pang, K Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper presents Video K-Net, a simple, strong, and unified framework for fully end-to-
end video panoptic segmentation. The method is built upon K-Net, a method that unifies …

A survey on deep learning technique for video segmentation

T Zhou, F Porikli, DJ Crandall… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Video segmentation—partitioning video frames into multiple segments or objects—plays a
critical role in a broad range of practical applications, from enhancing visual effects in movie …

Seqformer: Sequential transformer for video instance segmentation

J Wu, Y Jiang, S Bai, W Zhang, X Bai - European Conference on Computer …, 2022 - Springer
In this work, we present SeqFormer for video instance segmentation. SeqFormer follows the
principle of vision transformer that models instance relationships among video frames …