RFLA: Gaussian receptive field based label assignment for tiny object detection

C Xu, J Wang, W Yang, H Yu, L Yu, GS Xia - European conference on …, 2022 - Springer
Detecting tiny objects is one of the main obstacles hindering the development of object
detection. The performance of generic object detectors tends to drastically deteriorate on tiny …

Fairmot: On the fairness of detection and re-identification in multiple object tracking

Y Zhang, C Wang, X Wang, W Zeng, W Liu - International journal of …, 2021 - Springer
Multi-object tracking (MOT) is an important problem in computer vision which has a wide
range of applications. Formulating MOT as multi-task learning of object detection and re-ID …

Cyclemlp: A mlp-like architecture for dense prediction

S Chen, E Xie, C Ge, R Chen, D Liang… - arXiv preprint arXiv …, 2021 - arxiv.org
This paper presents a simple MLP-like architecture, CycleMLP, which is a versatile
backbone for visual recognition and dense predictions. As compared to modern MLP …

Codet: Co-occurrence guided region-word alignment for open-vocabulary object detection

C Ma, Y Jiang, X Wen, Z Yuan… - Advances in neural …, 2024 - proceedings.neurips.cc
Deriving reliable region-word alignment from image-text pairs is critical to learnobject-level
vision-language representations for open-vocabulary object detection. Existing methods …

As-mlp: An axial shifted mlp architecture for vision

D Lian, Z Yu, X Sun, S Gao - arXiv preprint arXiv:2107.08391, 2021 - arxiv.org
An Axial Shifted MLP architecture (AS-MLP) is proposed in this paper. Different from MLP-
Mixer, where the global spatial feature is encoded for information flow through matrix …

[HTML][HTML] 基于深度学习的视觉目标检测技术综述

曹家乐, 李亚利, 孙汉卿, 谢今, 黄凯奇, 庞彦伟 - 2022 - cjig.cn
摘要视觉目标检测旨在定位和识别图像中存在的物体, 属于计算机视觉领域的经典任务之一,
也是许多计算机视觉任务的前提与基础, 在自动驾驶, 视频监控等领域具有重要的应用价值 …

Sparse instance activation for real-time instance segmentation

T Cheng, X Wang, S Chen, W Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we propose a conceptually novel, efficient, and fully convolutional framework
for real-time instance segmentation. Previously, most instance segmentation methods …

MonoDETR: Depth-guided transformer for monocular 3D object detection

R Zhang, H Qiu, T Wang, Z Guo, Z Cui… - Proceedings of the …, 2023 - openaccess.thecvf.com
Monocular 3D object detection has long been a challenging task in autonomous driving.
Most existing methods follow conventional 2D detectors to first localize object centers, and …

Adamixer: A fast-converging query-based object detector

Z Gao, L Wang, B Han, S Guo - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Traditional object detectors employ the dense paradigm of scanning over locations and
scales in an image. The recent query-based object detectors break this convention by …

Language as queries for referring video object segmentation

J Wu, Y Jiang, P Sun, Z Yuan… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Referring video object segmentation (R-VOS) is an emerging cross-modal task that aims to
segment the target object referred by a language expression in all video frames. In this work …