Generalized relation modeling for transformer tracking

S Gao, C Zhou, J Zhang - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Compared with previous two-stream trackers, the recent one-stream tracking pipeline, which
allows earlier interaction between the template and search region, has achieved a …

The first visual object tracking segmentation vots2023 challenge results

M Kristan, J Matas, M Danelljan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract The Visual Object Tracking Segmentation VOTS2023 challenge is the eleventh
annual tracker benchmarking activity of the VOT initiative. This challenge is the first to merge …

Separable self and mixed attention transformers for efficient object tracking

GY Gopal, MA Amer - Proceedings of the IEEE/CVF Winter …, 2024 - openaccess.thecvf.com
The deployment of transformers for visual object tracking has shown state-of-the-art results
on several benchmarks. However, the transformer-based models are under-utilized for …

Unifying Visual and Vision-Language Tracking via Contrastive Learning

Y Ma, Y Tang, W Yang, T Zhang, J Zhang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Single object tracking aims to locate the target object in a video sequence according to the
state specified by different modal references, including the initial bounding box (BBOX) …

NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tracking

Y Liu, A Mahmood, MH Khan - Proceedings of the Asian …, 2024 - openaccess.thecvf.com
Many current tracking benchmarks, such as OTB100, Nfs, UAV123, LaSOT, and GOT-10K,
are approaching saturation ie, they are approaching their maximum score capacity. As such …

LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation

L Hong, Z Liu, W Chen, C Tan, Y Feng, X Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
Video object segmentation (VOS) aims to distinguish and track target objects in a video.
Despite the excellent performance achieved by off-the-shell VOS models, existing VOS …

ASAFormer: Visual tracking with convolutional vision transformer and asymmetric selective attention

X Gong, Y Zhang, S Hu - Knowledge-Based Systems, 2024 - Elsevier
Abstract Recently, Vision Transformer (ViT) has exhibited remarkable performances in many
computer vision tasks (eg object detection, segmentation and tracking). However, the output …

Discriminative target predictor based on temporal-scene attention context enhancement and candidate matching mechanism

B Cao, X Wu, X Zhang, Y Wang, Z Ma - Expert Systems with Applications, 2024 - Elsevier
Visual tracking is a fundamental task in computer vision, which extracts the target context
descriptions and features from the first image frame and tracks the target in subsequent …

Beyond SOT: Tracking Multiple Generic Objects at Once

C Mayer, M Danelljan, MH Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Generic Object Tracking (GOT) is the problem of tracking target objects, specified by
bounding boxes in the first frame of a video. While the task has received much attention in …

[HTML][HTML] Parameter-Efficient Tuning for Object Tracking by Migrating Pre-Trained Decoders

R Zhang, L Wang, S Yang - Electronics, 2024 - mdpi.com
Video object tracking has taken advantage of pre-trained weights on large-scale datasets.
However, most trackers fully fine-tune all the backbone's parameters for adjusting to tracking …