Practical Video Object Detection via Feature Selection and Aggregation

Y Shi, T Zhang, X Guo - arXiv preprint arXiv:2407.19650, 2024 - arxiv.org
Compared with still image object detection, video object detection (VOD) needs to
particularly concern the high across-frame variation in object appearance, and the diverse …

Yolov: Making still image object detectors great at video object detection

Y Shi, N Wang, X Guo - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Video object detection (VID) is challenging because of the high variation of object
appearance as well as the diverse deterioration in some frames. On the positive side, the …

Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection

B Zhang, S Wang, Y Liu, B Kusy, X Li, J Liu - Proceedings of the 31st …, 2023 - dl.acm.org
Current video object detection (VOD) models often encounter issues with over-aggregation
due to redundant aggregation strategies, which perform feature aggregation on every frame …

Learning where to focus for efficient video object detection

Z Jiang, Y Liu, C Yang, J Liu, P Gao, Q Zhang… - Computer Vision–ECCV …, 2020 - Springer
Transferring existing image-based detectors to the video is non-trivial since the quality of
frames is always deteriorated by part occlusion, rare pose, and motion blur. Previous …

Efficient one-stage video object detection by exploiting temporal consistency

G Sun, Y Hua, G Hu, N Robertson - European Conference on Computer …, 2022 - Springer
Recently, one-stage detectors have achieved competitive accuracy and faster speed
compared with traditional two-stage detectors on image data. However, in the field of video …

Dynamic feature aggregation for efficient video object detection

Y Cui - Proceedings of the Asian Conference on Computer …, 2022 - openaccess.thecvf.com
Video object detection is a fundamental yet challenging task in computer vision. One
practical solution is to take advantage of temporal information from the video and apply …

Mamba: Multi-level aggregation via memory bank for video object detection

G Sun, Y Hua, G Hu, N Robertson - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
State-of-the-art video object detection methods maintain a memory structure, either a sliding
window or a memory queue, to enhance the current frame using attention mechanisms …

Boxmask: Revisiting bounding box supervision for video object detection

KA Hashmi, A Pagani, D Stricker… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a new, simple yet effective approach to uplift video object detection. We observe
that prior works operate on instance-level feature aggregation that imminently neglects the …

Class-aware feature aggregation network for video object detection

L Han, P Wang, Z Yin, F Wang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Recent progress in video object detection (VOD) has shown that aggregating features from
other frames to capture long-range contextual information is very important to deal with the …

Joint representation of temporal image sequences and object motion for video object detection

J Koh, J Kim, Y Shin, B Lee, S Yang… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
In this paper, we propose a new video object detection (VoD) method, referred to as
temporal feature aggregation and motion-aware VoD (TM-VoD), that produces a joint …