Dual semantic fusion network for video object detection

[HTML][HTML] AI-based object detection latest trends in remote sensing, multimedia and agriculture applications

SA Nawaz, J Li, UA Bhatti, MU Shoukat… - Frontiers in Plant …, 2022 - frontiersin.org

Object detection is a vital research direction in machine vision and deep learning. The object
detection technique based on deep understanding has achieved tremendous progress in …

被引用次数：34 相关文章所有 6 个版本

[PDF] arxiv.org

TransVOD: end-to-end video object detection with spatial-temporal transformers

Q Zhou, X Li, L He, Y Yang, G Cheng… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …

被引用次数：93 相关文章所有 12 个版本

[PDF] thecvf.com

Crossover learning for fast online video instance segmentation

S Yang, Y Fang, X Wang, Y Li, C Fang… - Proceedings of the …, 2021 - openaccess.thecvf.com

Modeling temporal visual context across frames is critical for video instance segmentation
(VIS) and other video understanding tasks. In this paper, we propose a fast online VIS model …

被引用次数：111 相关文章所有 8 个版本

[PDF] arxiv.org

End-to-end video object detection with spatial-temporal transformers

L He, Q Zhou, X Li, L Niu, G Cheng, X Li, W Liu… - Proceedings of the 29th …, 2021 - dl.acm.org

Recently, DETR and Deformable DETR have been proposed to eliminate the need for many
hand-designed components in object detection while demonstrating good performance as …

被引用次数：89 相关文章所有 3 个版本

[PDF] arxiv.org

Hero: Hierarchical spatio-temporal reasoning with contrastive action correspondence for end-to-end video object grounding

M Li, T Wang, H Zhang, S Zhang, Z Zhao… - Proceedings of the 30th …, 2022 - dl.acm.org

Video Object Grounding (VOG) is the problem of associating spatial object regions in the
video to a descriptive natural language query. This is a challenging vision-language task …

被引用次数：25 相关文章所有 3 个版本

[PDF] arxiv.org

Damo-streamnet: Optimizing streaming perception in autonomous driving

JY He, ZQ Cheng, C Li, W Xiang, B Chen, B Luo… - arXiv preprint arXiv …, 2023 - arxiv.org

Real-time perception, or streaming perception, is a crucial aspect of autonomous driving that
has yet to be thoroughly explored in existing research. To address this gap, we present …

被引用次数：10 相关文章所有 6 个版本

Multilevel spatial-temporal feature aggregation for video object detection

C Xu, J Zhang, M Wang, G Tian… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Video object detection (VOD) focuses on detecting objects for each frame in a video, which
is a challenging task due to appearance deterioration in certain video frames. Recent works …

被引用次数：12 相关文章所有 3 个版本

Joint spatio-temporal similarity and discrimination learning for visual tracking

Y Liang, H Chen, Q Wu, C Xia… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Visual tracking is a task of localizing a target unceasingly in a video with an initial target
state at the first frame. The limited target information makes this problem an extremely …

被引用次数：3 相关文章

[PDF] thecvf.com

Identity-Consistent Aggregation for Video Object Detection

C Deng, D Chen, Q Wu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Abstract In Video Object Detection (VID), a common practice is to leverage the rich temporal
contexts from the video to enhance the object representations in each frame. Existing …

FastVOD-Net: A real-time and high-accuracy video object detector

Q Qi, X Wang, T Hou, Y Yan… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Video object detection is a tough task due to the severe appearance degradation caused by
rapid motion, sudden occlusion or rare poses. The great challenge facing video object …

被引用次数：8 相关文章所有 2 个版本

高级搜索

QQ 群