Boxmask: Revisiting bounding box supervision for video object detection

KA Hashmi, A Pagani, D Stricker… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a new, simple yet effective approach to uplift video object detection. We observe
that prior works operate on instance-level feature aggregation that imminently neglects the …

Objects do not disappear: Video object detection by single-frame object location anticipation

X Liu, FK Nejadasl, JC van Gemert… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Objects in videos are typically characterized by continuous smooth motion. We
exploit continuous smooth motion in three ways. 1) Improved accuracy by using object …

IMC-Det: Intra–Inter Modality Contrastive Learning for Video Object Detection

Q Qi, Z Qiu, Y Yan, Y Lu, H Wang - International Journal of Computer …, 2024 - Springer
Video object detection is an important yet challenging task in the computer vision field. One
limitation of off-the-shelf video object detection methods is that they only explore information …

Mamba: Multi-level aggregation via memory bank for video object detection

G Sun, Y Hua, G Hu, N Robertson - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
State-of-the-art video object detection methods maintain a memory structure, either a sliding
window or a memory queue, to enhance the current frame using attention mechanisms …

Leveraging long-range temporal relationships between proposals for video object detection

M Shvets, W Liu, AC Berg - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Single-frame object detectors perform well on videos sometimes, even without temporal
context. However, challenges such as occlusion, motion blur, and rare poses of objects are …

Practical Video Object Detection via Feature Selection and Aggregation

Y Shi, T Zhang, X Guo - arXiv preprint arXiv:2407.19650, 2024 - arxiv.org
Compared with still image object detection, video object detection (VOD) needs to
particularly concern the high across-frame variation in object appearance, and the diverse …

Efficient one-stage video object detection by exploiting temporal consistency

G Sun, Y Hua, G Hu, N Robertson - European Conference on Computer …, 2022 - Springer
Recently, one-stage detectors have achieved competitive accuracy and faster speed
compared with traditional two-stage detectors on image data. However, in the field of video …

Mining inter-video proposal relations for video object detection

M Han, Y Wang, X Chang, Y Qiao - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Recent studies have shown that, context aggregating information from proposals in different
frames can clearly enhance the performance of video object detection. However, these …

Semi-supervised dff: Decoupling detection and feature flow for video object detectors

G Han, X Zhang, C Li - Proceedings of the 26th ACM international …, 2018 - dl.acm.org
For efficient video object detection, our detector consists of a spatial module and a temporal
module. The spatial module aims to detect objects in static frames using convolutional …

Flow-guided feature aggregation for video object detection

X Zhu, Y Wang, J Dai, L Yuan… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Extending state-of-the-art object detectors from image to video is challenging. The accuracy
of detection suffers from degenerated object appearances in videos, eg, motion blur, video …