New generation deep learning for video object detection: A survey

L Jiao, R Zhang, F Liu, S Yang, B Hou… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Video object detection, a basic task in the computer vision field, is rapidly evolving and
widely used. In recent years, deep learning methods have rapidly become widespread in the …

A review of video object detection: Datasets, metrics and methods

H Zhu, H Wei, B Li, X Yuan, N Kehtarnavaz - Applied Sciences, 2020 - mdpi.com
Although there are well established object detection methods based on static images, their
application to video data on a frame by frame basis faces two shortcomings:(i) lack of …

Bevdet4d: Exploit temporal cues in multi-camera 3d object detection

J Huang, G Huang - arXiv preprint arXiv:2203.17054, 2022 - arxiv.org
Single frame data contains finite information which limits the performance of the existing
vision-based multi-camera 3D object detection paradigms. For fundamentally pushing the …

Classification of skin disease using deep learning neural networks with MobileNet V2 and LSTM

PN Srinivasu, JG SivaSai, MF Ijaz, AK Bhoi, W Kim… - Sensors, 2021 - mdpi.com
Deep learning models are efficient in learning the features that assist in understanding
complex patterns precisely. This study proposed a computerized process of classifying skin …

Deep learning for unmanned aerial vehicle-based object detection and tracking: A survey

X Wu, W Li, D Hong, R Tao, Q Du - IEEE Geoscience and …, 2021 - ieeexplore.ieee.org
Owing to effective and flexible data acquisition, unmanned aerial vehicles (UAVs) have
recently become a hotspot across the fields of computer vision (CV) and remote sensing …

TransVOD: end-to-end video object detection with spatial-temporal transformers

Q Zhou, X Li, L He, Y Yang, G Cheng… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …

RegNet: Self-regulated network for image classification

J Xu, Y Pan, X Pan, S Hoi, Z Yi… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The ResNet and its variants have achieved remarkable successes in various computer
vision tasks. Despite its success in making gradient flow through building blocks, the …

Flexible high-resolution object detection on edge devices with tunable latency

S Jiang, Z Lin, Y Li, Y Shu, Y Liu - Proceedings of the 27th Annual …, 2021 - dl.acm.org
Object detection is a fundamental building block of video analytics applications. While
Neural Networks (NNs)-based object detection models have shown excellent accuracy on …

Tracking pedestrian heads in dense crowd

R Sundararaman… - Proceedings of the …, 2021 - openaccess.thecvf.com
Tracking humans in crowded video sequences is an important constituent of visual scene
understanding. Increasing crowd density challenges visibility of humans, limiting the …

Interventional video relation detection

Y Li, X Yang, X Shang, TS Chua - Proceedings of the 29th ACM …, 2021 - dl.acm.org
Video Visual Relation Detection (VidVRD) aims to semantically describe the dynamic
interactions across visual concepts localized in a video in the form of subject, predicate …