TransVOD: end-to-end video object detection with spatial-temporal transformers

Q Zhou, X Li, L He, Y Yang, G Cheng… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the
need for many hand-designed components in object detection while demonstrating good …

End-to-end video object detection with spatial-temporal transformers

L He, Q Zhou, X Li, L Niu, G Cheng, X Li, W Liu… - Proceedings of the 29th …, 2021 - dl.acm.org
Recently, DETR and Deformable DETR have been proposed to eliminate the need for many
hand-designed components in object detection while demonstrating good performance as …

Inspro: Propagating instance query and proposal for online video instance segmentation

F He, H Zhang, N Gao, J Jia, Y Shan… - Advances in …, 2022 - proceedings.neurips.cc
Video instance segmentation (VIS) aims at segmenting and tracking objects in videos. Prior
methods typically generate frame-level or clip-level object instances first and then associate …

Queryprop: Object query propagation for high-performance video object detection

F He, N Gao, J Jia, X Zhao, K Huang - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Video object detection has been an important yet challenging topic in computer vision.
Traditional methods mainly focus on designing the image-level or box-level feature …

Dgrnet: A dual-level graph relation network for video object detection

Q Qi, T Hou, Y Lu, Y Yan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Video object detection is a fundamental and important task in computer vision. One mainstay
solution for this task is to aggregate features from different frames to enhance the detection …

Object detection in drone video with temporal attention gated recurrent unit based on transformer

Z Zhou, X Yu, X Chen - Drones, 2023 - mdpi.com
Unmanned aerial vehicle (UAV) based object detection plays a pivotal role in civil and
military fields. Unfortunately, the problem is more challenging than general visual object …

Temporal-adaptive sparse feature aggregation for video object detection

F He, Q Li, X Zhao, K Huang - Pattern Recognition, 2022 - Elsevier
Video object detection is a challenging task due to the appearance deterioration in video
frames. To enhance feature representation of the deteriorated frames, previous methods …

Query-memory re-aggregation for weakly-supervised video object segmentation

F Lin, H Xie, Y Li, Y Zhang - Proceedings of the AAAI conference on …, 2021 - ojs.aaai.org
Weakly-supervised video object segmentation (WVOS) is an emerging video task that can
track and segment the target given a simple bounding box label. However, existing WVOS …

Bilateral temporal re-aggregation for weakly-supervised video object segmentation

F Lin, H Xie, C Liu, Y Zhang - … on Circuits and Systems for Video …, 2021 - ieeexplore.ieee.org
Weakly-supervised video object segmentation is an emerging video task to track and
segment the target given a simple bounding box label, which requires the method to fully …

Class-aware dual-supervised aggregation network for video object detection

Q Qi, Y Yan, H Wang - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Video object detection has attracted increasing attention in recent years. Although great
success has been achieved by off-the-shelf video object detection methods through …