Seqtrack: Sequence to sequence learning for visual object tracking

X Chen, H Peng, D Wang, H Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper, we present a new sequence-to-sequence learning framework for visual
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …

Mixformer: End-to-end tracking with iterative mixed attention

Y Cui, C Jiang, L Wang, G Wu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …

Visual prompt multi-modal tracking

J Zhu, S Lai, X Chen, D Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Visible-modal object tracking gives rise to a series of downstream multi-modal tracking
tributaries. To inherit the powerful representations of the foundation model, a natural modus …

Segment and track anything

Y Cheng, L Li, Y Xu, X Li, Z Yang, W Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
This report presents a framework called Segment And Track Anything (SAMTrack) that
allows users to precisely and effectively segment and track any object in a video …

Track anything: Segment anything meets videos

J Yang, M Gao, Z Li, S Gao, F Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, the Segment Anything Model (SAM) gains lots of attention rapidly due to its
impressive segmentation performance on images. Regarding its strong ability on image …

Mixformerv2: Efficient fully transformer tracking

Y Cui, T Song, G Wu, L Wang - Advances in Neural …, 2024 - proceedings.neurips.cc
Transformer-based trackers have achieved strong accuracy on the standard benchmarks.
However, their efficiency remains an obstacle to practical deployment on both GPU and …

Single-model and any-modality for video object tracking

Z Wu, J Zheng, X Ren, FA Vasluianu… - Proceedings of the …, 2024 - openaccess.thecvf.com
In the realm of video object tracking auxiliary modalities such as depth thermal or event data
have emerged as valuable assets to complement the RGB trackers. In practice most existing …

Integrating boxes and masks: A multi-object framework for unified visual tracking and segmentation

Y Xu, Z Yang, Y Yang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Tracking any given object (s) spatially and temporally is a common purpose in Visual Object
Tracking (VOT) and Video Object Segmentation (VOS). Joint tracking and segmentation …

Onetracker: Unifying visual object tracking with foundation models and efficient tuning

L Hong, S Yan, R Zhang, W Li, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Visual object tracking aims to localize the target object of each frame based on its initial
appearance in the first frame. Depending on the input modility tracking tasks can be divided …

DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking

F Xie, Z Wang, C Ma - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Existing Siamese or transformer trackers commonly pose visual object tracking as a one-
shot detection problem ie locating the target object in a single forward evaluation scheme …