Objects do not disappear: Video object detection by single-frame object location anticipation

X Liu, FK Nejadasl, JC van Gemert… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Objects in videos are typically characterized by continuous smooth motion. We
exploit continuous smooth motion in three ways. 1) Improved accuracy by using object …

Leveraging long-range temporal relationships between proposals for video object detection

M Shvets, W Liu, AC Berg - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Single-frame object detectors perform well on videos sometimes, even without temporal
context. However, challenges such as occlusion, motion blur, and rare poses of objects are …

Boxmask: Revisiting bounding box supervision for video object detection

KA Hashmi, A Pagani, D Stricker… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a new, simple yet effective approach to uplift video object detection. We observe
that prior works operate on instance-level feature aggregation that imminently neglects the …

Video object detection via object-level temporal aggregation

CH Yao, C Fang, X Shen, Y Wan, MH Yang - Computer Vision–ECCV …, 2020 - Springer
While single-image object detectors can be naively applied to videos in a frame-by-frame
fashion, the prediction is often temporally inconsistent. Moreover, the computation can be …

SSVOD: Semi-supervised video object detection with sparse annotations

T Mahmud, CH Liu, B Yaman… - Proceedings of the …, 2024 - openaccess.thecvf.com
Despite significant progress in semi-supervised learning for image object detection, several
key issues are yet to be addressed for video object detection:(1) Achieving good …

Video object detection with locally-weighted deformable neighbors

Z Jiang, P Gao, C Guo, Q Zhang, S Xiang… - Proceedings of the AAAI …, 2019 - ojs.aaai.org
Deep convolutional neural networks have achieved great success on various image
recognition tasks. However, it is nontrivial to transfer the existing networks to video due to …

Identity-Consistent Aggregation for Video Object Detection

C Deng, D Chen, Q Wu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Abstract In Video Object Detection (VID), a common practice is to leverage the rich temporal
contexts from the video to enhance the object representations in each frame. Existing …

Flow-guided feature aggregation for video object detection

X Zhu, Y Wang, J Dai, L Yuan… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Extending state-of-the-art object detectors from image to video is challenging. The accuracy
of detection suffers from degenerated object appearances in videos, eg, motion blur, video …

Video object detection with an aligned spatial-temporal memory

F Xiao, YJ Lee - … of the European conference on computer …, 2018 - openaccess.thecvf.com
Abstract We introduce Spatial-Temporal Memory Networks for video object detection. At its
core, a novel Spatial-Temporal Memory module (STMM) serves as the recurrent …

Semi-supervised dff: Decoupling detection and feature flow for video object detectors

G Han, X Zhang, C Li - Proceedings of the 26th ACM international …, 2018 - dl.acm.org
For efficient video object detection, our detector consists of a spatial module and a temporal
module. The spatial module aims to detect objects in static frames using convolutional …