Sparse r-cnn: End-to-end object detection with learnable proposals

C Xu, J Wang, W Yang, H Yu, L Yu, GS Xia - European conference on …, 2022 - Springer

Detecting tiny objects is one of the main obstacles hindering the development of object
detection. The performance of generic object detectors tends to drastically deteriorate on tiny …

被引用次数：100 相关文章所有 4 个版本

[PDF] arxiv.org

Fairmot: On the fairness of detection and re-identification in multiple object tracking

Y Zhang, C Wang, X Wang, W Zeng, W Liu - International journal of …, 2021 - Springer

Multi-object tracking (MOT) is an important problem in computer vision which has a wide
range of applications. Formulating MOT as multi-task learning of object detection and re-ID …

被引用次数：1236 相关文章所有 8 个版本

[PDF] arxiv.org

Cyclemlp: A mlp-like architecture for dense prediction

S Chen, E Xie, C Ge, R Chen, D Liang… - arXiv preprint arXiv …, 2021 - arxiv.org

This paper presents a simple MLP-like architecture, CycleMLP, which is a versatile
backbone for visual recognition and dense predictions. As compared to modern MLP …

被引用次数：241 相关文章所有 5 个版本

[PDF] neurips.cc

Codet: Co-occurrence guided region-word alignment for open-vocabulary object detection

C Ma, Y Jiang, X Wen, Z Yuan… - Advances in neural …, 2024 - proceedings.neurips.cc

Deriving reliable region-word alignment from image-text pairs is critical to learnobject-level
vision-language representations for open-vocabulary object detection. Existing methods …

被引用次数：22 相关文章所有 6 个版本

[PDF] arxiv.org

As-mlp: An axial shifted mlp architecture for vision

D Lian, Z Yu, X Sun, S Gao - arXiv preprint arXiv:2107.08391, 2021 - arxiv.org

An Axial Shifted MLP architecture (AS-MLP) is proposed in this paper. Different from MLP-
Mixer, where the global spatial feature is encoded for information flow through matrix …

被引用次数：209 相关文章所有 3 个版本

[HTML] cjig.cn

[HTML][HTML] 基于深度学习的视觉目标检测技术综述

曹家乐，李亚利，孙汉卿，谢今，黄凯奇，庞彦伟 - 2022 - cjig.cn

摘要视觉目标检测旨在定位和识别图像中存在的物体, 属于计算机视觉领域的经典任务之一,
也是许多计算机视觉任务的前提与基础, 在自动驾驶, 视频监控等领域具有重要的应用价值 …

被引用次数：26 相关文章所有 4 个版本

[PDF] thecvf.com

Sparse instance activation for real-time instance segmentation

T Cheng, X Wang, S Chen, W Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com

In this paper, we propose a conceptually novel, efficient, and fully convolutional framework
for real-time instance segmentation. Previously, most instance segmentation methods …

被引用次数：109 相关文章所有 5 个版本

[PDF] thecvf.com

MonoDETR: Depth-guided transformer for monocular 3D object detection

R Zhang, H Qiu, T Wang, Z Guo, Z Cui… - Proceedings of the …, 2023 - openaccess.thecvf.com

Monocular 3D object detection has long been a challenging task in autonomous driving.
Most existing methods follow conventional 2D detectors to first localize object centers, and …

被引用次数：106 相关文章所有 6 个版本

[PDF] thecvf.com

Adamixer: A fast-converging query-based object detector

Z Gao, L Wang, B Han, S Guo - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Traditional object detectors employ the dense paradigm of scanning over locations and
scales in an image. The recent query-based object detectors break this convention by …

被引用次数：105 相关文章所有 6 个版本

[PDF] thecvf.com

Language as queries for referring video object segmentation

J Wu, Y Jiang, P Sun, Z Yuan… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Referring video object segmentation (R-VOS) is an emerging cross-modal task that aims to
segment the target object referred by a language expression in all video frames. In this work …

被引用次数：118 相关文章所有 7 个版本

高级搜索

QQ 群