L Jiao, R Zhang, F Liu, S Yang, B Hou… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Video object detection, a basic task in the computer vision field, is rapidly evolving and widely used. In recent years, deep learning methods have rapidly become widespread in the …
X Zhu, J Dai, L Yuan, Y Wei - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
There has been significant progresses for image object detection recently. Nevertheless, video object detection has received little attention, although it is more challenging and more …
Video object detection is challenging because objects that are easily detected in one frame may be difficult to detect in another frame within the same clip. Recently, there have been …
Z Xu, E Hrustic, D Vivet - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
The existing methods for video object detection mainly depend on two-stage image object detectors. The fact that two-stage detectors are generally slow makes it difficult to apply in …
Transferring existing image-based detectors to the video is non-trivial since the quality of frames is always deteriorated by part occlusion, rare pose, and motion blur. Previous …
Y Cui - Proceedings of the Asian Conference on Computer …, 2022 - openaccess.thecvf.com
Video object detection is a fundamental yet challenging task in computer vision. One practical solution is to take advantage of temporal information from the video and apply …
M Shvets, W Liu, AC Berg - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Single-frame object detectors perform well on videos sometimes, even without temporal context. However, challenges such as occlusion, motion blur, and rare poses of objects are …
Y Cui, L Yan, Z Cao, D Liu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Video objection detection is a challenging task because isolated video frames may encounter appearance deterioration, which introduces great confusion for detection. One of …
State-of-the-art video object detection methods maintain a memory structure, either a sliding window or a memory queue, to enhance the current frame using attention mechanisms …