Weakly Misalignment-free Adaptive Feature Alignment for UAVs-based Multimodal Object Detection

C Chen, J Qi, X Liu, K Bin, R Fu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Visible-infrared (RGB-IR) image fusion has shown great potentials in object
detection based on unmanned aerial vehicles (UAVs). However the weakly misalignment …

Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection

L Fu, W Gu, Y Ai, W Li, D Wang - Infrared Physics & Technology, 2021 - Elsevier
A pedestrian detector that uses visible and thermal infrared image pairs as the input has
better detection performance than a detector that uses only visible image under challenging …

MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection

T Kim, S Chung, D Yeom, Y Yu, HG Kim… - arXiv preprint arXiv …, 2024 - arxiv.org
Multispectral pedestrian detection is attractive for around-the-clock applications due to the
complementary information between RGB and thermal modalities. However, current models …

Removal and selection: Improving rgb-infrared object detection via coarse-to-fine fusion

T Zhao, M Yuan, X Wei - arXiv preprint arXiv:2401.10731, 2024 - arxiv.org
Object detection in visible (RGB) and infrared (IR) images has been widely applied in recent
years. Leveraging the complementary characteristics of RGB and IR images, the object …

A non-parametric softmax for improving neural attention in time-series forecasting

S Totaro, A Hussain, S Scardapane - Neurocomputing, 2020 - Elsevier
Neural attention has become a key component in many deep learning applications, ranging
from machine translation to time series forecasting. While many variations of attention have …

Beyond Fusion: Modality Hallucination-based Multispectral Fusion for Pedestrian Detection

Q Xie, TY Cheng, JX Zhong, K Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Pedestrian detection is a fundamental task for many downstream applications. Visible and
thermal images, as the two most important data types, are usually used to detect pedestrians …

Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection

C Tian, Z Zhou, Y Huang, G Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
RGB-Thermal (RGB-T) pedestrian detection aims to locate pedestrians in RGB-T image
pairs to exploit the complementation between the two modalities for improving detection …

Cross-Modal Oriented Object Detection of UAV Aerial Images Based on Image Feature

H Wang, C Wang, Q Fu, D Zhang, R Kou… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Arbitrary-oriented object detection is vital for improving unmanned aerial vehicle (UAV)
sensing and has promising applications. However, challenges persist in detecting objects …

[HTML][HTML] Cascaded information enhancement and cross-modal attention feature fusion for multispectral pedestrian detection

Y Yang, K Xu, K Wang - Frontiers in Physics, 2023 - frontiersin.org
Multispectral pedestrian detection is a technology designed to detect and locate pedestrians
in Color and Thermal images, which has been widely used in automatic driving, video …

Region-Based Illumination-Temperature Awareness and Cross-Modality Enhancement for Multispectral Pedestrian Detection

Y Liu, C Hu, B Zhao, Y Huang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Multispectral pedestrian detection based on RGB-thermal (RGB-T) camera has been
actively studied in autonomous driving in recent years as its robustness under complex …