Transflow: Transformer as flow learner

Y Lu, Q Wang, S Ma, T Geng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Optical flow is an indispensable building block for various important computer vision tasks,
including motion estimation, object tracking, and disparity measurement. In this work, we …

Feature aggregated queries for transformer-based video object detectors

Y Cui - Proceedings of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Video object detection needs to solve feature degradation situations that rarely happen in
the image domain. One solution is to use the temporal information and fuse the features from …

RingMo-sense: Remote sensing foundation model for spatiotemporal prediction via spatiotemporal evolution disentangling

F Yao, W Lu, H Yang, L Xu, C Liu, L Hu… - … on Geoscience and …, 2023 - ieeexplore.ieee.org
Remote sensing (RS) spatiotemporal prediction aims to infer future trends from historical
spatiotemporal data, eg, videos and time-series images, which has a broad application …

Transformers in small object detection: A benchmark and survey of state-of-the-art

AM Rekavandi, S Rashidi, F Boussaid, S Hoefs… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformers have rapidly gained popularity in computer vision, especially in the field of
object recognition and detection. Upon examining the outcomes of state-of-the-art object …

基于Transformer 的目标检测算法综述.

李建, 杜建强, 朱彦陈, 郭永坤 - Journal of Computer …, 2023 - search.ebscohost.com
深度学习框架Transformer 具有强大的建模能力和并行计算能力, 目前基于Transformer
的目标检测算法已经成为研究的热点. 为了进一步探索目标检测的新思路, 新方向 …

Longshortnet: Exploring temporal and semantic features fusion in streaming perception

C Li, ZQ Cheng, JY He, P Li, B Luo… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Streaming perception is a fundamental task in autonomous driving that requires a careful
balance between the latency and accuracy of the autopilot system. However, current …

Faq: Feature aggregated queries for transformer-based video object detectors

Y Cui, L Yang - arXiv preprint arXiv:2303.08319, 2023 - arxiv.org
Video object detection needs to solve feature degradation situations that rarely happen in
the image domain. One solution is to use the temporal information and fuse the features from …

[HTML][HTML] S-DETR: A Transformer model for real-time detection of marine ships

Z Xing, J Ren, X Fan, Y Zhang - Journal of Marine Science and …, 2023 - mdpi.com
Due to the ever-changing shape and scale of ships, as well as the complex sea background,
accurately detecting multi-scale ships on the sea while considering real-time requirements …

Objects do not disappear: Video object detection by single-frame object location anticipation

X Liu, FK Nejadasl, JC van Gemert… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Objects in videos are typically characterized by continuous smooth motion. We
exploit continuous smooth motion in three ways. 1) Improved accuracy by using object …

Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions

X Zhang, CH Chou - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
When deploying pre-trained video object detectors in real-world scenarios the domain gap
between training and testing data caused by adverse image conditions often leads to …