作者
Shu Tian, Xu-Cheng Yin, Ya Su, Hong-Wei Hao
发表日期
2018/3/1
期刊
IEEE Transactions on Pattern Analysis and Machine Intelligence
卷号
40
期号
3
页码范围
542-554
出版商
IEEE
简介
Video text extraction plays an important role for multimedia understanding and retrieval. Most previous research efforts are conducted within individual frames. A few of recent methods, which pay attention to text tracking using multiple frames, however, do not effectively mine the relations among text detection, tracking and recognition. In this paper, we propose a generic Bayesian-based framework of Tracking based Text Detection And Recognition (T2DAR) from web videos for embedded captions, which is composed of three major components, i.e., text tracking, tracking based text detection, and tracking based text recognition. In this unified framework, text tracking is first conducted by tracking-by-detection. Tracking trajectories are then revised and refined with detection or recognition results. Text detection or recognition is finally improved with multi-frame integration. Moreover, a challenging video text (embedded …
引用总数
201720182019202020212022202320246121315121271
学术搜索中的文章
S Tian, XC Yin, Y Su, HW Hao - IEEE transactions on pattern analysis and machine …, 2017