Video object detection with an aligned spatial-temporal memory

F Xiao, YJ Lee - … of the European Conference on Computer …, 2018 - openaccess.thecvf.com
Abstract We introduce Spatial-Temporal Memory Networks for video object detection. At its
core, a novel Spatial-Temporal Memory module (STMM) serves as the recurrent …

Ptseformer: Progressive temporal-spatial enhanced transformer towards video object detection

H Wang, J Tang, X Liu, S Guan, R Xie… - European Conference on …, 2022 - Springer
Recent years have witnessed a trend of applying context frames to boost the performance of
object detection as video object detection. Existing methods usually aggregate features at …

Flow-guided feature aggregation for video object detection

X Zhu, Y Wang, J Dai, L Yuan… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Extending state-of-the-art object detectors from image to video is challenging. The accuracy
of detection suffers from degenerated object appearances in videos, eg, motion blur, video …

Leveraging long-range temporal relationships between proposals for video object detection

M Shvets, W Liu, AC Berg - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Single-frame object detectors perform well on videos sometimes, even without temporal
context. However, challenges such as occlusion, motion blur, and rare poses of objects are …

Mamba: Multi-level aggregation via memory bank for video object detection

G Sun, Y Hua, G Hu, N Robertson - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
State-of-the-art video object detection methods maintain a memory structure, either a sliding
window or a memory queue, to enhance the current frame using attention mechanisms …

End-to-end video object detection with spatial-temporal transformers

L He, Q Zhou, X Li, L Niu, G Cheng, X Li, W Liu… - Proceedings of the 29th …, 2021 - dl.acm.org
Recently, DETR and Deformable DETR have been proposed to eliminate the need for many
hand-designed components in object detection while demonstrating good performance as …

Centernet heatmap propagation for real-time video object detection

Z Xu, E Hrustic, D Vivet - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
The existing methods for video object detection mainly depend on two-stage image object
detectors. The fact that two-stage detectors are generally slow makes it difficult to apply in …

Progressive sparse local attention for video object detection

C Guo, B Fan, J Gu, Q Zhang, S Xiang… - Proceedings of the …, 2019 - openaccess.thecvf.com
Transferring image-based object detectors to the domain of videos remains a challenging
problem. Previous efforts mostly exploit optical flow to propagate features across frames …

Object guided external memory network for video object detection

H Deng, Y Hua, T Song, Z Zhang… - Proceedings of the …, 2019 - openaccess.thecvf.com
Video object detection is more challenging than image object detection because of the
deteriorated frame quality. To enhance the feature representation, state-of-the-art methods …

Learning where to focus for efficient video object detection

Z Jiang, Y Liu, C Yang, J Liu, P Gao, Q Zhang… - Computer Vision–ECCV …, 2020 - Springer
Transferring existing image-based detectors to the video is non-trivial since the quality of
frames is always deteriorated by part occlusion, rare pose, and motion blur. Previous …