Transformers in small object detection: A benchmark and survey of state-of-the-art

AM Rekavandi, S Rashidi, F Boussaid, S Hoefs… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformers have rapidly gained popularity in computer vision, especially in the field of
object recognition and detection. Upon examining the outcomes of state-of-the-art object …

Video2music: Suitable music generation from videos using an affective multimodal transformer model

J Kang, S Poria, D Herremans - Expert Systems with Applications, 2024 - Elsevier
Numerous studies in the field of music generation have demonstrated impressive
performance, yet virtually no models are able to directly generate music to match …

Lightweight multiobject ship tracking algorithm based on trajectory association and improved YOLOv7tiny

K Hao, Z Deng, B Wang, Z Jin, Z Li, X Zhao - Expert Systems with …, 2025 - Elsevier
In response to various challenges, such as high computational complexity, large parameter
counts, and frequent vessel ID switching in ship tracking models, we propose a trajectory …

A lightweight small object detection algorithm based on improved YOLOv5 for driving scenarios

Z Wen, J Su, Y Zhang, M Li, G Gan, S Zhang… - International Journal of …, 2023 - Springer
Small object detection has been a longstanding challenge in the field of object detection,
and achieving high detection accuracy is crucial for autonomous driving, especially for small …

[HTML][HTML] Enhancing pavement crack segmentation via semantic diffusion synthesis model for strategic road assessment

S Cano-Ortiz, E Sainz-Ortiz, LL Iglesias… - Results in …, 2024 - Elsevier
Computer-aided deep learning has significantly advanced road crack segmentation.
However, supervised models face challenges due to limited annotated images. There is also …

Novel Pipeline Integrating Cross-Modality and Motion Model for Nearshore Multi-Object Tracking in Optical Video Surveillance

J Ding, W Li, L Pei, M Yang, A Tian… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Nearshore multi-object tracking (NMOT) aims to locat and identify nearshore objects. Most
approaches accomplish this task using radar and remote-sensing technologies. In contrast …

[PDF][PDF] ViT 和注意力融合的类别不均衡PCB 缺陷检测方法

陈俊英, 李朝阳, 席月芸, 刘冲 - 仪器仪表学报, 2024 - yqyb.cnjournals.com
针对实际环境下印刷电路板(PCB) 缺陷样本难以收集造成的数据长尾分布和检测精度低以及ViT
用于检测时计算复杂度高等问题, 提出多尺度ViT 特征提取和注意力特征融合的端到端PCB …

Rtsds: a real-time and efficient method for detecting surface defects in strip steel

Q Zeng, D Wei, M Zou - Journal of Real-Time Image Processing, 2024 - Springer
To address the issues of varying defect sizes, inconsistent data quality, and real-time
detection challenges in steel defect detection, we propose a real-time efficient steel defect …

[HTML][HTML] A Lightweight Cross-Layer Smoke-Aware Network

J Wang, X Zhang, C Zhang - Sensors (Basel, Switzerland), 2024 - ncbi.nlm.nih.gov
Smoke is an obvious sign of pre-fire. However, due to its variable morphology, the existing
schemes are difficult to extract precise smoke characteristics, which seriously affects the …

[引用][C] 并行特征提取和渐进特征融合的计算机主板装配缺陷检测

陈俊英, 李朝阳, 黄汉涛, 董戌泽 - Optics and Precision …, 2024 - 光学精密工程编辑部