Unseen object instance segmentation with fully test-time rgb-d embeddings adaptation

L Zhang, S Zhang, X Yang, H Qiao… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Segmenting unseen objects is a crucial ability for the robot since it may encounter new
environments during the operation. Recently, a popular solution is leveraging RGB-D …

Illumination-aware window transformer for RGBT modality fusion

L Zhou, Z Chen - Journal of Visual Communication and Image …, 2023 - Elsevier
Combination of RGB and thermal sensors has been proven to be useful for many vision
applications. However, how to effectively fuse the information of two modalities remains a …

Efficient detection of multilingual hate speech by using interactive attention network with minimal human feedback

F Vitiugin, Y Senarath, H Purohit - Proceedings of the 13th ACM Web …, 2021 - dl.acm.org
Online hate speech on social media has become a critical problem for social network
services that has been further fueled by the self-isolation in the COVID-2019 pandemic …

[HTML][HTML] Multi-Path interactive network for aircraft identification with optical and SAR images

Q Gao, Z Feng, S Yang, Z Chang, R Wang - Remote Sensing, 2022 - mdpi.com
Aircraft identification has been a research hotspot in remote-sensing fields. However, due to
the presence of clouds in satellite-borne optical imagery, it is difficult to identify aircraft using …

HAFNet: Hierarchical attentive fusion network for multispectral pedestrian detection

P Peng, T Xu, B Huang, J Li - Remote Sensing, 2023 - mdpi.com
Multispectral pedestrian detection via visible and thermal image pairs has received
widespread attention in recent years. It provides a promising multi-modality solution to …

Vision Fourier transformer empowered multi-modal imaging system for ethane leakage detection

J Bin, S Rogers, Z Liu - Information Fusion, 2024 - Elsevier
A leak detection is an essential procedure to guarantee reliable functioning during ethane
production and transportation with infrared imaging. However, infrared imaging cannot …

Multispectral interaction convolutional neural network for pedestrian detection

J Ryu, J Kim, H Kim, S Kim - Computer Vision and Image Understanding, 2022 - Elsevier
Fusion of multispectral data in object detection is inevitable in order to cover various
environments. However, there is still insufficient research on how to fuse information …

TFDet: Target-Aware Fusion for RGB-T Pedestrian Detection

X Zhang, XH Zhang, J Ying, Z Sheng, H Yu, C Li… - arXiv preprint arXiv …, 2023 - arxiv.org
Pedestrian detection plays a critical role in computer vision as it contributes to ensuring
traffic safety. Existing methods that rely solely on RGB images suffer from performance …

[PDF][PDF] 红外—可见光跨模态的行人检测综述

别倩, 王晓, 徐新, 赵启军, 王正, 陈军, 胡瑞敏 - 中国图象图形学报, 2023 - cjig.cn
可见光图像在光照充足的条件下可以提供一系列辅助检测行人的信息, 如颜色和纹理等信息,
但在低照度场景下表现并不理想. 红外图像虽然不能提供颜色和纹理信息 …

Multi-modal pedestrian detection with misalignment based on modal-wise regression and multi-modal IoU

N Wanchaitanawong, M Tanaka… - Journal of …, 2023 - spiedigitallibrary.org
Multi-modal pedestrian detection, which integrates visible and thermal sensors, has been
developed to overcome many limitations of visible-modal pedestrian detection, such as poor …