Real-time scene text detection with differentiable binarization and adaptive scale fusion

M Liao, Z Zou, Z Wan, C Yao… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …

Sam-adapter: Adapting segment anything in underperformed scenes

T Chen, L Zhu, C Deng, R Cao… - Proceedings of the …, 2023 - openaccess.thecvf.com
The emergence of large models, also known as foundation models, has brought significant
advancements to AI research. One such model is Segment Anything (SAM), which is …

A survey on vision transformer

K Han, Y Wang, H Chen, X Chen, J Guo… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …

Deep hierarchical semantic segmentation

L Li, T Zhou, W Wang, J Li… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Humans are able to recognize structured relations in observation, allowing us to decompose
complex scenes into simpler parts and abstract the visual world in multiple levels. However …

Exploring cross-image pixel contrast for semantic segmentation

W Wang, T Zhou, F Yu, J Dai… - Proceedings of the …, 2021 - openaccess.thecvf.com
Current semantic segmentation methods focus only on mining" local" context, ie,
dependencies between pixels within individual images, by context-aggregation modules …

SAM Fails to Segment Anything?--SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and More

T Chen, L Zhu, C Ding, R Cao, Y Wang, Z Li… - arXiv preprint arXiv …, 2023 - arxiv.org
The emergence of large models, also known as foundation models, has brought significant
advancements to AI research. One such model is Segment Anything (SAM), which is …

Adaptive prototype learning and allocation for few-shot segmentation

G Li, V Jampani, L Sevilla-Lara… - Proceedings of the …, 2021 - openaccess.thecvf.com
Prototype learning is extensively used for few-shot segmentation. Typically, a single
prototype is obtained from the support feature by averaging the global object information …

Self-support few-shot semantic segmentation

Q Fan, W Pei, YW Tai, CK Tang - European Conference on Computer …, 2022 - Springer
Existing few-shot segmentation methods have achieved great progress based on the
support-query matching framework. But they still heavily suffer from the limited coverage of …

Restr: Convolution-free referring image segmentation using transformers

N Kim, D Kim, C Lan, W Zeng… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Referring image segmentation is an advanced semantic segmentation task where target is
not a predefined class but is described in natural language. Most of existing methods for this …

Monodtr: Monocular 3d object detection with depth-aware transformer

KC Huang, TH Wu, HT Su… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Monocular 3D object detection is an important yet challenging task in autonomous driving.
Some existing methods leverage depth information from an off-the-shelf depth estimator to …