Temporal context enhanced feature aggregation for video object detection

F He, N Gao, Q Li, S Du, X Zhao, K Huang - Proceedings of the AAAI …, 2020 - ojs.aaai.org
Video object detection is a challenging task because of the presence of appearance
deterioration in certain video frames. One typical solution is to aggregate neighboring …

Mamba: Multi-level aggregation via memory bank for video object detection

G Sun, Y Hua, G Hu, N Robertson - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
State-of-the-art video object detection methods maintain a memory structure, either a sliding
window or a memory queue, to enhance the current frame using attention mechanisms …

Exploiting better feature aggregation for video object detection

L Han, P Wang, Z Yin, F Wang, H Li - Proceedings of the 28th ACM …, 2020 - dl.acm.org
Video object detection (VOD) has been a rising topic in recent years due to the challenges
such as occlusion, motion blur, etc. To deal with these challenges, feature aggregation from …

Ptseformer: Progressive temporal-spatial enhanced transformer towards video object detection

H Wang, J Tang, X Liu, S Guan, R Xie… - European Conference on …, 2022 - Springer
Recent years have witnessed a trend of applying context frames to boost the performance of
object detection as video object detection. Existing methods usually aggregate features at …

Object detection in video with spatial-temporal context aggregation

H Luo, L Huang, H Shen, Y Li, C Huang… - arXiv preprint arXiv …, 2019 - arxiv.org
Recent cutting-edge feature aggregation paradigms for video object detection rely on
inferring feature correspondence. The feature correspondence estimation problem is …

Dynamic feature aggregation for efficient video object detection

Y Cui - Proceedings of the Asian Conference on Computer …, 2022 - openaccess.thecvf.com
Video object detection is a fundamental yet challenging task in computer vision. One
practical solution is to take advantage of temporal information from the video and apply …

Video object detection via object-level temporal aggregation

CH Yao, C Fang, X Shen, Y Wan, MH Yang - Computer Vision–ECCV …, 2020 - Springer
While single-image object detectors can be naively applied to videos in a frame-by-frame
fashion, the prediction is often temporally inconsistent. Moreover, the computation can be …

Video object detection with an aligned spatial-temporal memory

F Xiao, YJ Lee - … of the European conference on computer …, 2018 - openaccess.thecvf.com
Abstract We introduce Spatial-Temporal Memory Networks for video object detection. At its
core, a novel Spatial-Temporal Memory module (STMM) serves as the recurrent …

Dgrnet: A dual-level graph relation network for video object detection

Q Qi, T Hou, Y Lu, Y Yan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Video object detection is a fundamental and important task in computer vision. One mainstay
solution for this task is to aggregate features from different frames to enhance the detection …

Class-aware dual-supervised aggregation network for video object detection

Q Qi, Y Yan, H Wang - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Video object detection has attracted increasing attention in recent years. Although great
success has been achieved by off-the-shelf video object detection methods through …