查看文章

thecvf.com 中的 [PDF]

Tf-blender: Temporal feature blender for video object detection

作者

Yiming Cui, Liqi Yan, Zhiwen Cao, Dongfang Liu

发表日期

2021

研讨会论文

Proceedings of the IEEE/CVF international conference on computer vision

页码范围

8138-8147

简介

Video objection detection is a challenging task because isolated video frames may encounter appearance deterioration, which introduces great confusion for detection. One of the popular solutions is to exploit the temporal information and enhance per-frame representation through aggregating features from neighboring frames. Despite achieving improvements in detection, existing methods focus on the selection of higher-level video frames for aggregation rather than modeling lower-level temporal relations to increase the feature representation. To address this limitation, we propose a novel solution named TF-Blender, which includes three modules: 1) Temporal relation models the relations between the current frame and its neighboring frames to preserve spatial information. 2). Feature adjustment enriches the representation of every neighboring feature map; 3) Feature blender combines outputs from the first two modules and produces stronger features for the later detection tasks. For its simplicity, TF-Blender can be effortlessly plugged into any detection network to improve detection behavior. Extensive evaluations on ImageNet VID and YouTube-VIS benchmarks indicate the performance guarantees of using TF-Blender on recent state-of-the-art methods.

引用总数

被引用次数：138

20222023202434 65 39

学术搜索中的文章

Tf-blender: Temporal feature blender for video object detection

Y Cui, L Yan, Z Cao, D Liu - Proceedings of the IEEE/CVF international conference …, 2021

被引用次数：138 相关文章所有 6 个版本