A review of video object detection: Datasets, metrics and methods

H Zhu, H Wei, B Li, X Yuan, N Kehtarnavaz - Applied Sciences, 2020 - mdpi.com
Although there are well established object detection methods based on static images, their
application to video data on a frame by frame basis faces two shortcomings:(i) lack of …

Bevdet4d: Exploit temporal cues in multi-camera 3d object detection

J Huang, G Huang - arXiv preprint arXiv:2203.17054, 2022 - arxiv.org
Single frame data contains finite information which limits the performance of the existing
vision-based multi-camera 3D object detection paradigms. For fundamentally pushing the …

A comparative analysis of object detection metrics with a companion open-source toolkit

R Padilla, WL Passos, TLB Dias, SL Netto… - Electronics, 2021 - mdpi.com
Recent outstanding results of supervised object detection in competitions and challenges
are often associated with specific metrics and datasets. The evaluation of such methods …

Detection and tracking meet drones challenge

P Zhu, L Wen, D Du, X Bian, H Fan… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Drones, or general UAVs, equipped with cameras have been fast deployed with a wide
range of applications, including agriculture, aerial photography, and surveillance …

Tf-blender: Temporal feature blender for video object detection

Y Cui, L Yan, Z Cao, D Liu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Video objection detection is a challenging task because isolated video frames may
encounter appearance deterioration, which introduces great confusion for detection. One of …

Memory enhanced global-local aggregation for video object detection

Y Chen, Y Cao, H Hu, L Wang - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
How do humans recognize an object in a piece of video? Due to the deteriorated quality of
single frame, it may be hard for people to identify an occluded object in this frame by just …

TransVOD: end-to-end video object detection with spatial-temporal transformers

Q Zhou, X Li, L He, Y Yang, G Cheng… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …

Retinatrack: Online single stage joint detection and tracking

Z Lu, V Rathod, R Votel… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Traditionally multi-object tracking and object detection are performed using separate
systems with most prior works focusing exclusively on one of these aspects over the other …

End-to-end video object detection with spatial-temporal transformers

L He, Q Zhou, X Li, L Niu, G Cheng, X Li, W Liu… - Proceedings of the 29th …, 2021 - dl.acm.org
Recently, DETR and Deformable DETR have been proposed to eliminate the need for many
hand-designed components in object detection while demonstrating good performance as …

3d-man: 3d multi-frame attention network for object detection

Z Yang, Y Zhou, Z Chen… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract 3D object detection is an important module in autonomous driving and robotics.
However, many existing methods focus on using single frames to perform 3D detection, and …