Sparse r-cnn: End-to-end object detection with learnable proposals

Y Gu, J Chi, J Liu, L Yang, B Zhang, D Yu… - Computers in biology …, 2021 - Elsevier

Lung cancer has one of the highest mortalities of all cancers. According to the National Lung
Screening Trial, patients who underwent low-dose computed tomography (CT) scanning …

被引用次数：105 相关文章所有 4 个版本

[PDF] thecvf.com

Dynamic head: Unifying object detection heads with attentions

X Dai, Y Chen, B Xiao, D Chen, M Liu… - Proceedings of the …, 2021 - openaccess.thecvf.com

The complex nature of combining localization and classification in object detection has
resulted in the flourished development of methods. Previous works tried to improve the …

被引用次数：487 相关文章所有 6 个版本

[PDF] thecvf.com

Swin transformer: Hierarchical vision transformer using shifted windows

Z Liu, Y Lin, Y Cao, H Hu, Y Wei… - Proceedings of the …, 2021 - openaccess.thecvf.com

This paper presents a new vision Transformer, called Swin Transformer, that capably serves
as a general-purpose backbone for computer vision. Challenges in adapting Transformer …

被引用次数：18548 相关文章所有 12 个版本

[PDF] thecvf.com

Pyramid vision transformer: A versatile backbone for dense prediction without convolutions

W Wang, E Xie, X Li, DP Fan, K Song… - Proceedings of the …, 2021 - openaccess.thecvf.com

Although convolutional neural networks (CNNs) have achieved great success in computer
vision, this work investigates a simpler, convolution-free backbone network useful for many …

被引用次数：3510 相关文章所有 10 个版本

[PDF] arxiv.org

Focal self-attention for local-global interactions in vision transformers

J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan… - arXiv preprint arXiv …, 2021 - arxiv.org

Recently, Vision Transformer and its variants have shown great promise on various
computer vision tasks. The ability of capturing short-and long-range visual dependencies …

被引用次数：409 相关文章所有 2 个版本

[PDF] thecvf.com

Dynamic detr: End-to-end object detection with dynamic attention

X Dai, Y Chen, J Yang, P Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com

In this paper, we present a novel Dynamic DETR (Detection with Transformers) approach by
introducing dynamic attentions into both the encoder and decoder stages of DETR to break …

被引用次数：244 相关文章所有 5 个版本

[PDF] thecvf.com

Detco: Unsupervised contrastive learning for object detection

E Xie, J Ding, W Wang, X Zhan, H Xu… - Proceedings of the …, 2021 - openaccess.thecvf.com

We present DetCo, a simple yet effective self-supervised approach for object detection.
Unsupervised pre-training methods have been recently designed for object detection, but …

被引用次数：343 相关文章所有 9 个版本

[PDF] thecvf.com

Instances as queries

Y Fang, S Yang, X Wang, Y Li, C Fang… - Proceedings of the …, 2021 - openaccess.thecvf.com

We present QueryInst, a new perspective for instance segmentation. QueryInst is a multi-
stage end-to-end system that treats instances of interest as learnable queries, enabling …

被引用次数：255 相关文章所有 8 个版本

[PDF] arxiv.org

Fairmot: On the fairness of detection and re-identification in multiple object tracking

Y Zhang, C Wang, X Wang, W Zeng, W Liu - International Journal of …, 2021 - Springer

Multi-object tracking (MOT) is an important problem in computer vision which has a wide
range of applications. Formulating MOT as multi-task learning of object detection and re-ID …

被引用次数：1183 相关文章所有 10 个版本

[PDF] arxiv.org

A simple single-scale vision transformer for object localization and instance segmentation

W Chen, X Du, F Yang, L Beyer, X Zhai, TY Lin… - arXiv preprint arXiv …, 2021 - arxiv.org

This work presents a simple vision transformer design as a strong baseline for object
localization and instance segmentation tasks. Transformers recently demonstrate …

被引用次数：165 相关文章所有 5 个版本

高级搜索

QQ 群