SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving

Y Cui, C Han, D Liu - arXiv preprint arXiv:2405.18857, 2024 - arxiv.org
Visual-based perception is the key module for autonomous driving. Among those visual
perception tasks, video object detection is a primary yet challenging one because of feature …

Stepwise Spatial Global-local Aggregation Networks for Autonomous Driving

Y Cui, C Han, D Liu - Journal on Autonomous Transportation Systems, 2024 - dl.acm.org
Visual-based perception is the key module for autonomous driving. Among those visual
perception tasks, video object detection is a primary yet challenging one because of feature …

Video representation learning through prediction for online object detection

M Fujitake, A Sugimoto - Proceedings of the IEEE/CVF …, 2022 - openaccess.thecvf.com
We present a video representation learning framework for real-time video object detection.
Our approach is based on the interesting observation that a powerful prior knowledge of …

Memory enhanced global-local aggregation for video object detection

Y Chen, Y Cao, H Hu, L Wang - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
How do humans recognize an object in a piece of video? Due to the deteriorated quality of
single frame, it may be hard for people to identify an occluded object in this frame by just …

Relation-Guided Multi-stage Feature Aggregation Network for Video Object Detection

T Yao, F Cao, F Mi, D Li - Chinese Conference on Pattern Recognition and …, 2023 - Springer
Video object detection task has received extensive research attention and various methods
have been proposed. The quality of single frame in the original video is usually deteriorated …

[PDF][PDF] STF: Spatio-Temporal Fusion Module for Improving Video Object Detection

N Anwar, GA Bilodeau… - Proceedings of the …, 2024 - assets.pubpub.org
Consecutive frames in a video contain redundancy, but they may also contain relevant
complementary information for the detection task. The objective of our work is to leverage …

Object-aware feature aggregation for video object detection

Q Geng, H Zhang, N Jiang, X Qi, L Zhang… - arXiv preprint arXiv …, 2020 - arxiv.org
We present an Object-aware Feature Aggregation (OFA) module for video object detection
(VID). Our approach is motivated by the intriguing property that video-level object-aware …

Progressive sparse local attention for video object detection

C Guo, B Fan, J Gu, Q Zhang, S Xiang… - Proceedings of the …, 2019 - openaccess.thecvf.com
Transferring image-based object detectors to the domain of videos remains a challenging
problem. Previous efforts mostly exploit optical flow to propagate features across frames …

Identity-Consistent Aggregation for Video Object Detection

C Deng, D Chen, Q Wu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Abstract In Video Object Detection (VID), a common practice is to leverage the rich temporal
contexts from the video to enhance the object representations in each frame. Existing …

Class-aware dual-supervised aggregation network for video object detection

Q Qi, Y Yan, H Wang - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Video object detection has attracted increasing attention in recent years. Although great
success has been achieved by off-the-shelf video object detection methods through …