A comprehensive survey on segment anything model for vision and beyond

C Zhang, L Liu, Y Cui, G Huang, W Lin, Y Yang… - arXiv preprint arXiv …, 2023 - arxiv.org
Artificial intelligence (AI) is evolving towards artificial general intelligence, which refers to the
ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence …

Mixformer: End-to-end tracking with iterative mixed attention

Y Cui, C Jiang, L Wang, G Wu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …

Dynamic 3d gaussians: Tracking by persistent dynamic view synthesis

J Luiten, G Kopanas, B Leibe… - … Conference on 3D …, 2024 - ieeexplore.ieee.org
We present a method that simultaneously addresses the tasks of dynamic scene novel-view
synthesis and six degree-of-freedom (6-DOF) tracking of all dense scene elements. We …

Transforming model prediction for tracking

C Mayer, M Danelljan, G Bhat, M Paul… - Proceedings of the …, 2022 - openaccess.thecvf.com
Optimization based tracking methods have been widely successful by integrating a target
model prediction module, providing effective global reasoning by minimizing an objective …

Visual prompt multi-modal tracking

J Zhu, S Lai, X Chen, D Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Visible-modal object tracking gives rise to a series of downstream multi-modal tracking
tributaries. To inherit the powerful representations of the foundation model, a natural modus …

Track anything: Segment anything meets videos

J Yang, M Gao, Z Li, S Gao, F Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, the Segment Anything Model (SAM) gains lots of attention rapidly due to its
impressive segmentation performance on images. Regarding its strong ability on image …

[HTML][HTML] Multi-camera multi-object tracking: a review of current trends and future advances

TI Amosa, P Sebastian, LI Izhar, O Ibrahim, LS Ayinla… - Neurocomputing, 2023 - Elsevier
The nascent applicability of multi-camera tracking (MCT) in numerous real-world
applications makes it a significant computer vision problem. While visual tracking of objects …

Single-model and any-modality for video object tracking

Z Wu, J Zheng, X Ren, FA Vasluianu… - Proceedings of the …, 2024 - openaccess.thecvf.com
In the realm of video object tracking auxiliary modalities such as depth thermal or event data
have emerged as valuable assets to complement the RGB trackers. In practice most existing …

Helping hands: An object-aware ego-centric video recognition model

C Zhang, A Gupta, A Zisserman - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We introduce an object-aware decoder for improving the performance of spatio-temporal
representations on ego-centric videos. The key idea is to enhance object-awareness during …

Onetracker: Unifying visual object tracking with foundation models and efficient tuning

L Hong, S Yan, R Zhang, W Li, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Visual object tracking aims to localize the target object of each frame based on its initial
appearance in the first frame. Depending on the input modility tracking tasks can be divided …