[HTML][HTML] Attention mechanisms in computer vision: A survey

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer
Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

[HTML][HTML] Deep learning-based building extraction from remote sensing images: A comprehensive review

L Luo, P Li, X Yan - Energies, 2021 - mdpi.com
Building extraction from remote sensing (RS) images is a fundamental task for geospatial
applications, aiming to obtain morphology, location, and other information about buildings …

Lisa: Reasoning segmentation via large language model

X Lai, Z Tian, Y Chen, Y Li, Y Yuan… - Proceedings of the …, 2024 - openaccess.thecvf.com
Although perception systems have made remarkable advancements in recent years they still
rely on explicit human instruction or pre-defined categories to identify the target objects …

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery

L Wang, R Li, C Zhang, S Fang, C Duan, X Meng… - ISPRS Journal of …, 2022 - Elsevier
Semantic segmentation of remotely sensed urban scene images is required in a wide range
of practical applications, such as land cover mapping, urban change detection …

Real-time scene text detection with differentiable binarization and adaptive scale fusion

M Liao, Z Zou, Z Wan, C Yao… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …

Segmenter: Transformer for semantic segmentation

R Strudel, R Garcia, I Laptev… - Proceedings of the …, 2021 - openaccess.thecvf.com
Image segmentation is often ambiguous at the level of individual image patches and
requires contextual information to reach label consensus. In this paper we introduce …

UTNet: a hybrid transformer architecture for medical image segmentation

Y Gao, M Zhou, DN Metaxas - … , France, September 27–October 1, 2021 …, 2021 - Springer
Transformer architecture has emerged to be successful in a number of natural language
processing tasks. However, its applications to medical vision remain largely unexplored. In …

Sam-adapter: Adapting segment anything in underperformed scenes

T Chen, L Zhu, C Deng, R Cao… - Proceedings of the …, 2023 - openaccess.thecvf.com
The emergence of large models, also known as foundation models, has brought significant
advancements to AI research. One such model is Segment Anything (SAM), which is …

Deep hierarchical semantic segmentation

L Li, T Zhou, W Wang, J Li… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Humans are able to recognize structured relations in observation, allowing us to decompose
complex scenes into simpler parts and abstract the visual world in multiple levels. However …

Sa-net: Shuffle attention for deep convolutional neural networks

QL Zhang, YB Yang - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org
Attention mechanisms, which enable a neural network to accurately focus on all the relevant
elements of the input, have become an essential component to improve the performance of …