Symmetrical linguistic feature distillation with clip for scene text recognition

Z Wang, H Xie, Y Wang, J Xu, B Zhang… - Proceedings of the 31st …, 2023 - dl.acm.org
… the potential of the Contrastive LanguageImage Pretraining (CLIP) model in scene text
From two to one: A new scene text recognizer with visual language modeling network. In …

VOLTER: Visual Collaboration and Dual-Stream Fusion for Scene Text Recognition

JN Li, XQ Liu, X Luo, XS Xu - IEEE Transactions on Multimedia, 2024 - ieeexplore.ieee.org
… and LM, we introduce a VisionLanguage Contrastive (VLC) module by encouraging positive
… of scene text recognition task. In this method, we propose a novel and powerful visual

Strokelets: A learned multi-scale representation for scene text recognition

C Yao, X Bai, B Shi, W Liu - … vision and pattern recognition, 2014 - openaccess.thecvf.com
text recognition in natural scenes (aka scene text recognition) … characteristics of the
corresponding languages. For example, … could learn a hybrid set of strokelets on multiple …

IterVM: iterative vision modeling module for scene text recognition

X Chu, Y Wang - … Conference on Pattern Recognition (ICPR), 2022 - ieeexplore.ieee.org
… of extracting visual features for scene text recognition. … language modeling module (IterLM)
in ABINet [10], our IterVM can iteratively enhance the visual features for scene text recognition

Accurate scene text recognition based on recurrent neural network

B Su, S Lu - … 2014: 12th Asian Conference on Computer Vision …, 2015 - Springer
Scene text recognition is a useful but very challenging task due to uncontrolled condition of
text in natural scenes. This paper presents a novel approach to recognize text in scene

Scene text detection and recognition: Recent advances and future trends

Y Zhu, C Yao, X Bai - Frontiers of Computer Science, 2016 - Springer
… review of works on scene text detection and recognition in the … ] in the fields of scene text
detection and recognition. However, … level (character detection) and high level (language prior) …

Seed: Semantics enhanced encoder-decoder framework for scene text recognition

Z Qiao, Y Zhou, D Yang, Y Zhou… - … pattern recognition, 2020 - openaccess.thecvf.com
… the visual feature and the decoder focusing on the language … We propose SEED for scene
text recognition, which predicts … by the word embedding from a pre-trained language model. …

Scene text detection and recognition with advances in deep learning: a survey

X Liu, G Meng, C Pan - … Journal on Document Analysis and Recognition  …, 2019 - Springer
… as script identification, text/non-text classification and text-to-… directions on scene text
detection and recognition that need … Visual understanding: Vision and language is an interesting …

Context-based contrastive learning for scene text recognition

X Zhang, B Zhu, X Yao, Q Sun, R Li, B Yu - Proceedings of the AAAI …, 2022 - ojs.aaai.org
… , we also adopt a language model (LM) same as (Fang et al. 2021). Following the same
experimental setting, we first use ConCLR to pretrain a vision model (ABINet-Vision), and then …

Scene text recognition with sliding convolutional character models

F Yin, YC Wu, XY Zhang, CL Liu - arXiv preprint arXiv:1709.01727, 2017 - arxiv.org
vision is suppressed so that no new information is acquired [37]. Inspired by these, we build
our scene text recognition … in recognition, we trained a 5-gram character language model (LM…