Istr: End-to-end instance segmentation with transformers

W Gu, S Bai, L Kong - Image and Vision Computing, 2022 - Elsevier

Image instance segmentation involves labeling pixels of images with classes and instances,
which is one of the pivotal technologies in many domains, such as natural scenes …

被引用次数：141 相关文章所有 2 个版本

[PDF] mdpi.com

A survey of visual transformers

Y Liu, Y Zhang, Y Wang, F Hou, J Yuan… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Transformer, an attention-based encoder–decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some …

被引用次数：356 相关文章所有 22 个版本

[PDF] baai.ac.cn

A survey on vision transformer

K Han, Y Wang, H Chen, X Chen, J Guo… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …

被引用次数：2025 相关文章所有 7 个版本

[PDF] ieee.org

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

被引用次数：69 相关文章所有 3 个版本

[PDF] arxiv.org

A survey on visual transformer

K Han, Y Wang, H Chen, X Chen, J Guo, Z Liu… - arXiv preprint arXiv …, 2020 - arxiv.org

Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …

被引用次数：363 相关文章所有 3 个版本

[PDF] thecvf.com

Sparse instance activation for real-time instance segmentation

T Cheng, X Wang, S Chen, W Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com

In this paper, we propose a conceptually novel, efficient, and fully convolutional framework
for real-time instance segmentation. Previously, most instance segmentation methods …

被引用次数：131 相关文章所有 5 个版本

[PDF] arxiv.org

P2T: Pyramid pooling transformer for scene understanding

YH Wu, Y Liu, X Zhan… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Recently, the vision transformer has achieved great success by pushing the state-of-the-art
of various vision tasks. One of the most challenging problems in the vision transformer is that …

被引用次数：217 相关文章所有 14 个版本

[PDF] arxiv.org

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arXiv preprint arXiv:2305.02187, 2023 - arxiv.org

We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …

被引用次数：75 相关文章所有 5 个版本

[PDF] thecvf.com

Swintextspotter: Scene text spotting via better synergy between text detection and text recognition

M Huang, Y Liu, Z Peng, C Liu, D Lin… - proceedings of the …, 2022 - openaccess.thecvf.com

End-to-end scene text spotting has attracted great attention in recent years due to the
success of excavating the intrinsic synergy of the scene text detection and recognition …

被引用次数：112 相关文章所有 6 个版本

[PDF] github.io