Cross-modal orthogonal high-rank augmentation for rgb-event transformer-trackers

Z Zhu, J Hou, DO Wu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
This paper addresses the problem of cross-modal object tracking from RGB videos and
event data. Rather than constructing a complex cross-modal fusion network, we explore the …

Visevent: Reliable object tracking via collaboration of frame and event flows

X Wang, J Li, L Zhu, Z Zhang, Z Chen… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Different from visible cameras which record intensity images frame by frame, the biologically
inspired event camera produces a stream of asynchronous and sparse events with much …

Learning spatial-frequency transformer for visual object tracking

C Tang, X Wang, Y Bai, Z Wu, J Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Recently, some researchers have begun to adopt the Transformer to combine or replace the
widely used ResNet as their new backbone network. As the Transformer captures the long …

Cross-modality hierarchical clustering and refinement for unsupervised visible-infrared person re-identification

Z Pang, C Wang, L Zhao, Y Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Visible-infrared person re-identification (VI-ReID) is a challenging cross-modality image
retrieval task. Compared to visible modality person re-identification that handles only the …

Real-time multi-task facial analytics with event cameras

C Ryan, A Elrasad, W Shariff, J Lemley, P Kielty… - IEEE …, 2023 - ieeexplore.ieee.org
Event cameras, unlike traditional frame-based cameras, excel in detecting and reporting
changes in light intensity on a per-pixel basis. This unique technology offers numerous …

Ecsnet: Spatio-temporal feature learning for event camera

Z Chen, J Wu, J Hou, L Li, W Dong… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The neuromorphic event cameras can efficiently sense the latent geometric structures and
motion clues of a scene by generating asynchronous and sparse event signals. Due to the …

Siamthn: Siamese target highlight network for visual tracking

J Bao, K Chen, X Sun, L Zhao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Siamese network based trackers develop rapidly in the field of visual object tracking in
recent years. The majority of Siamese network based trackers now in use treat each channel …

Unsupervised deep event stereo for depth estimation

SMN Uddin, SH Ahmed, YJ Jung - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Bio-inspired event cameras have been considered effective alternatives to traditional frame-
based cameras for stereo depth estimation, especially in challenging conditions such as low …

Switch and refine: A long-term tracking and segmentation framework

X Xu, J Zhao, J Wu, F Shen - … on Circuits and Systems for Video …, 2022 - ieeexplore.ieee.org
In long-term video object tracking (VOT) tasks, most long-term trackers are modified from
short-term trackers, which contain more and more machine learning modules to improve …

Joint spatio-temporal similarity and discrimination learning for visual tracking

Y Liang, H Chen, Q Wu, C Xia… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Visual tracking is a task of localizing a target unceasingly in a video with an initial target
state at the first frame. The limited target information makes this problem an extremely …