Repvit: Revisiting mobile cnn from vit perspective

A Wang, H Chen, Z Lin, J Han… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Recently lightweight Vision Transformers (ViTs) demonstrate superior performance
and lower latency compared with lightweight Convolutional Neural Networks (CNNs) on …

UAV-YOLOv8: A small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios

G Wang, Y Chen, P An, H Hong, J Hu, T Huang - Sensors, 2023 - mdpi.com
Unmanned aerial vehicle (UAV) object detection plays a crucial role in civil, commercial, and
military domains. However, the high proportion of small objects in UAV images and the …

BL-YOLOv8: An improved road defect detection model based on YOLOv8

X Wang, H Gao, Z Jia, Z Li - Sensors, 2023 - mdpi.com
Road defect detection is a crucial task for promptly repairing road damage and ensuring
road safety. Traditional manual detection methods are inefficient and costly. To overcome …

Rmt: Retentive networks meet vision transformers

Q Fan, H Huang, M Chen, H Liu… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Vision Transformer (ViT) has gained increasing attention in the computer vision
community in recent years. However the core component of ViT Self-Attention lacks explicit …

A lightweight transformer network for hyperspectral image classification

X Zhang, Y Su, L Gao, L Bruzzone… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Transformer is a powerful tool for capturing long-range dependencies and has shown
impressive performance in hyperspectral image (HSI) classification. However, such power …

Wnet: W-shaped hierarchical network for remote sensing image change detection

X Tang, T Zhang, J Ma, X Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Change detection (CD) is a hot research topic in the remote-sensing (RS) community. With
the increasing availability of high-resolution (HR) RS images, there is a growing demand for …

HiFuse: Hierarchical multi-scale feature fusion network for medical image classification

X Huo, G Sun, S Tian, Y Wang, L Yu, J Long… - … Signal Processing and …, 2024 - Elsevier
Effective fusion of global and local multi-scale features is crucial for medical image
classification. Medical images have many noisy, scattered features, intra-class variations …

Tea tree pest detection algorithm based on improved Yolov7-Tiny

Z Yang, H Feng, Y Ruan, X Weng - Agriculture, 2023 - mdpi.com
Timely and accurate identification of tea tree pests is critical for effective tea tree pest control.
We collected image data sets of eight common tea tree pests to accurately represent the true …

An improved wildfire smoke detection based on YOLOv8 and UAV images

SN Saydirasulovich, M Mukhiddinov, O Djuraev… - Sensors, 2023 - mdpi.com
Forest fires rank among the costliest and deadliest natural disasters globally. Identifying the
smoke generated by forest fires is pivotal in facilitating the prompt suppression of developing …

TransNeXt: Robust Foveal Visual Perception for Vision Transformers

D Shi - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Due to the depth degradation effect in residual connections many efficient Vision
Transformers models that rely on stacking layers for information exchange often fail to form …