SDPDet: Learning Scale-Separated Dynamic Proposals for End-to-End Drone-View Detection

N Yin, C Liu, R Tian, X Qian - IEEE Transactions on Multimedia, 2024 - ieeexplore.ieee.org
Detecting objects in large-scale drone-view images is notoriously challenging due to their
uneven distribution and scale variation caused by photoing angles. Common approaches …

PairDETR: Joint Detection and Association of Human Bodies and Faces

A Ali, G Gaikov, D Rybalchenko… - Proceedings of the …, 2024 - openaccess.thecvf.com
Image and video analysis requires not only accurate object but also the understanding of
relationships among detected objects. Common solutions to relation modeling typically …

Generative Region-Language Pretraining for Open-Ended Object Detection

C Lin, Y Jiang, L Qu, Z Yuan… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
In recent research significant attention has been devoted to the open-vocabulary object
detection task aiming to generalize beyond the limited number of classes labeled during …

RailFOD23: A dataset for foreign object detection on railroad transmission lines

Z Chen, J Yang, Z Feng, H Zhu - Scientific Data, 2024 - nature.com
Artificial intelligence models play a crucial role in monitoring and maintaining railroad
infrastructure by analyzing image data of foreign objects on power transmission lines …

Hiri-vit: Scaling vision transformer with high resolution inputs

T Yao, Y Li, Y Pan, T Mei - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
The hybrid deep models of Vision Transformer (ViT) and Convolution Neural Network (CNN)
have emerged as a powerful class of backbones for vision tasks. Scaling up the input …

Vehicle detection algorithms for autonomous driving: a review

L Liang, H Ma, L Zhao, X Xie, C Hua, M Zhang… - Sensors, 2024 - mdpi.com
Autonomous driving, as a pivotal technology in modern transportation, is progressively
transforming the modalities of human mobility. In this domain, vehicle detection is a …

Mobileinst: Video instance segmentation on the mobile

R Zhang, T Cheng, S Yang, H Jiang, S Zhang… - Proceedings of the …, 2024 - ojs.aaai.org
Video instance segmentation on mobile devices is an important yet very challenging edge AI
problem. It mainly suffers from (1) heavy computation and memory costs for frame-by-frame …

ISTR: Mask-Embedding-Based Instance Segmentation Transformer

J Hu, Y Lu, S Zhang, L Cao - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
Transformer-based instance-level recognition has attracted increasing research attention
recently due to the superior performance. However, although attempts have been made to …

Higher efficient YOLOv7: a one-stage method for non-salient object detection

C Dong, Y Tang, L Zhang - Multimedia Tools and Applications, 2024 - Springer
Compared to the remarkable progress within the discipline of object detection in recent
years, real-time detection of non-salient objects remains a challenging research task …

[HTML][HTML] Benchmarking wild bird detection in complex forest scenes

Q Song, Y Guan, X Guo, X Guo, Y Chen, H Wang… - Ecological …, 2024 - Elsevier
Camera traps are widely used for wildlife monitoring and making informed conservation and
land-management decisions, but the resulting 'big data'are laborious to process. Deep …