From two to one: A new scene text recognizer with visual language modeling network

Y Wang, H Xie, S Fang, J Wang… - … on Computer Vision, 2021 - openaccess.thecvf.com
vision model with language capability. Specially, we introduce the text recognition of character
Such operation guides the vision model to use not only the visual texture of characters, but …

Vision transformer for fast and efficient scene text recognition

R Atienza - … conference on document analysis and recognition, 2021 - Springer
Scene text recognition (STR) enables computers to read text in natural scenes such as object
labels, road signs and instructions. STR helps machines perform informed decisions such …

Read like humans: Autonomous, bidirectional and iterative language modeling for scene text recognition

S Fang, H Xie, Y Wang, Z Mao… - … and pattern recognition, 2021 - openaccess.thecvf.com
… ; and 3) language model with noise … scene text recognition. Firstly, the autonomous suggests
to block gradient flow between vision and language models to enforce explicitly language

Multi-granularity prediction for scene text recognition

P Wang, C Da, C Yao - European Conference on Computer Vision, 2022 - Springer
language information of text. In order to effectively resort to linguistic information for scene text
recognition… in NLP [7] into text recognition method. Subword tokenization algorithms aim to …

Svtr: Scene text recognition with a single visual model

Y Du, Z Chen, C Jia, X Yin, T Zheng, C Li, Y Du… - arXiv preprint arXiv …, 2022 - arxiv.org
… model for feature extraction and a sequence model for text … a Single Visual model for Scene
Text recognition within the … : A new scene text recognizer with visual language modeling …

Behind the scene: Revealing the secrets of pre-trained vision-and-language models

J Cao, Z Gan, Y Cheng, L Yu, YC Chen… - … Vision–ECCV 2020: 16th …, 2020 - Springer
… -trained models have revolutionized vision-and-language (V+L) … behind the scene, we present
Value (Vision-And-Language … , Visual Coreference Resolution, Visual Relation Detection) …

Scene text detection and recognition: The deep learning era

S Long, X He, C Yao - International Journal of Computer Vision, 2021 - Springer
… For example, instances of scene text can be in different languages, colors, fonts, sizes, … that
scene text detection can be taxonomically subsumed under general object detection, which is …

Dictionary-guided scene text recognition

N Nguyen, T Nguyen, V Tran, MT Tran… - … Recognition, 2021 - openaccess.thecvf.com
language prior is a potential approach to advance scene text … Moreover, many languages
have special symbols that have … of the current scene text recognition pipeline by introducing a …

Scene text recognition with permuted autoregressive sequence models

D Bautista, R Atienza - European conference on computer vision, 2022 - Springer
… iterative language modeling for scene text recognition. In: Proceedings of the IEEE/CVF
Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7098–7107, June 2021 …

Visual semantics allow for textual reasoning better in scene text recognition

Y He, C Chen, J Zhang, J Liu, F He, C Wang… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
… a graph-based context reasoning model that supplements the language model to exploit
both visual spatial context and linguistic context to improve the visual recognition results. …