From two to one: A new scene text recognizer with visual language modeling network

Y Wang, H Xie, S Fang, J Wang… - … on Computer Vision, 2021 - openaccess.thecvf.com
vision model with language capability. Specially, we introduce the text recognition of character
Such operation guides the vision model to use not only the visual texture of characters, but …

Vision transformer for fast and efficient scene text recognition

R Atienza - … conference on document analysis and recognition, 2021 - Springer
Scene text recognition (STR) enables computers to read text in natural scenes such as object
labels, road signs and instructions. STR helps machines perform informed decisions such …

Read like humans: Autonomous, bidirectional and iterative language modeling for scene text recognition

S Fang, H Xie, Y Wang, Z Mao… - … and pattern recognition, 2021 - openaccess.thecvf.com
… ; and 3) language model with noise … scene text recognition. Firstly, the autonomous suggests
to block gradient flow between vision and language models to enforce explicitly language

Svtr: Scene text recognition with a single visual model

Y Du, Z Chen, C Jia, X Yin, T Zheng, C Li, Y Du… - arXiv preprint arXiv …, 2022 - arxiv.org
… model for feature extraction and a sequence model for text … a Single Visual model for Scene
Text recognition within the … : A new scene text recognizer with visual language modeling …

Scene text detection and recognition: The deep learning era

S Long, X He, C Yao - International Journal of Computer Vision, 2021 - Springer
… For example, instances of scene text can be in different languages, colors, fonts, sizes, … that
scene text detection can be taxonomically subsumed under general object detection, which is …

Behind the scene: Revealing the secrets of pre-trained vision-and-language models

J Cao, Z Gan, Y Cheng, L Yu, YC Chen… - … Vision–ECCV 2020: 16th …, 2020 - Springer
… -trained models have revolutionized vision-and-language (V+L) … behind the scene, we present
Value (Vision-And-Language … , Visual Coreference Resolution, Visual Relation Detection) …

Dictionary-guided scene text recognition

N Nguyen, T Nguyen, V Tran, MT Tran… - … Recognition, 2021 - openaccess.thecvf.com
language prior is a potential approach to advance scene text … Moreover, many languages
have special symbols that have … of the current scene text recognition pipeline by introducing a …

Scene text recognition with permuted autoregressive sequence models

D Bautista, R Atienza - European conference on computer vision, 2022 - Springer
… iterative language modeling for scene text recognition. In: Proceedings of the IEEE/CVF
Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7098–7107, June 2021 …

Review of scene text detection and recognition

H Lin, P Yang, F Zhang - Archives of computational methods in …, 2020 - Springer
… , scene text detection is a challenging problem. Similar to majority of computer vision tasks,
most previous text detection … With text recognition, techniques related to language model and …

VOLTER: Visual Collaboration and Dual-Stream Fusion for Scene Text Recognition

JN Li, XQ Liu, X Luo, XS Xu - IEEE Transactions on Multimedia, 2024 - ieeexplore.ieee.org
… and LM, we introduce a VisionLanguage Contrastive (VLC) module by encouraging positive
… of scene text recognition task. In this method, we propose a novel and powerful visual