Text detection, tracking and recognition in video: a comprehensive survey

XC Yin, ZY Zuo, S Tian, CL Liu - IEEE Transactions on Image …, 2016 - ieeexplore.ieee.org
The intelligent analysis of video data is currently in wide demand because a video is a major
source of sensory data in our lives. Text is a prominent and direct source of information in …

A survey of zero-shot learning: Settings, methods, and applications

W Wang, VW Zheng, H Yu, C Miao - ACM Transactions on Intelligent …, 2019 - dl.acm.org
Most machine-learning methods focus on classifying instances whose classes have already
been seen in training. In practice, many applications require classifying instances whose …

Mixed high-order attention network for person re-identification

B Chen, W Deng, J Hu - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Attention has become more attractive in person re-identification (ReID) as it is capable of
biasing the allocation of available resources towards the most informative parts of an input …

Fine-grained video-text retrieval with hierarchical graph reasoning

S Chen, Y Zhao, Q Jin, Q Wu - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
Cross-modal retrieval between videos and texts has attracted growing attentions due to the
rapid emergence of videos on the web. The current dominant approach is to learn a joint …

Dual encoding for video retrieval by text

J Dong, X Li, C Xu, X Yang, G Yang… - … on Pattern Analysis …, 2021 - ieeexplore.ieee.org
This paper attacks the challenging problem of video retrieval by text. In such a retrieval
paradigm, an end user searches for unlabeled videos by ad-hoc queries described …

Learning deep representations of fine-grained visual descriptions

S Reed, Z Akata, H Lee… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
State-of-the-art methods for zero-shot visual recognition formulate learning as a joint
embedding problem of images and side information. In these formulations the current best …

Dual encoding for zero-example video retrieval

J Dong, X Li, C Xu, S Ji, Y He… - Proceedings of the …, 2019 - openaccess.thecvf.com
This paper attacks the challenging problem of zero-example video retrieval. In such a
retrieval paradigm, an end user searches for unlabeled videos by ad-hoc queries described …

Reading-strategy inspired visual representation learning for text-to-video retrieval

J Dong, Y Wang, X Chen, X Qu, X Li… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
This paper aims for the task of text-to-video retrieval, where given a query in the form of a
natural-language sentence, it is asked to retrieve videos which are semantically relevant to …

Zero-shot event detection via event-adaptive concept relevance mining

Z Li, L Yao, X Chang, K Zhan, J Sun, H Zhang - Pattern Recognition, 2019 - Elsevier
Zero-shot complex event detection has been an emerging task in coping with the scarcity of
labeled training videos in practice. Aiming to progress beyond the state-of-the-art zero-shot …

Visual semantic search: Retrieving videos via complex textual queries

D Lin, S Fidler, C Kong… - Proceedings of the IEEE …, 2014 - openaccess.thecvf.com
In this paper, we tackle the problem of retrieving videos using complex natural language
queries. Towards this goal, we first parse the sentential descriptions into a semantic graph …