Learning diverse models for end-to-end ensemble tracking

N Wang, W Zhou, H Li - IEEE Transactions on Image …, 2021 - ieeexplore.ieee.org
IEEE Transactions on Image Processing, 2021ieeexplore.ieee.org
In visual tracking, how to effectively model the target appearance using limited prior
information remains an open problem. In this paper, we leverage an ensemble of diverse
models to learn manifold representations for robust object tracking. The proposed ensemble
framework includes a shared backbone network for efficient feature extraction and multiple
head networks for independent predictions. Trained by the shared data within an identical
structure, the mutually correlated head models heavily hinder the potential of ensemble …
In visual tracking, how to effectively model the target appearance using limited prior information remains an open problem. In this paper, we leverage an ensemble of diverse models to learn manifold representations for robust object tracking. The proposed ensemble framework includes a shared backbone network for efficient feature extraction and multiple head networks for independent predictions. Trained by the shared data within an identical structure, the mutually correlated head models heavily hinder the potential of ensemble learning. To shrink the representational overlaps among multiple models while encouraging the diversity of individual predictions, we propose the model diversity and response diversity regularization terms during training. By fusing these distinctive prediction results via a fusion module, the tracking variance caused by the distractor objects can be largely restrained. Our whole framework is end-to-end trained in a data-driven manner, avoiding the heuristic designs of multiple base models and fusion strategies. The proposed method achieves state-of-the-art results on seven challenging benchmarks while operating in real-time.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果