Read like humans: Autonomous, bidirectional and iterative language modeling for scene text recognition

S Fang, H Xie, Y Wang, Z Mao… - Proceedings of the …, 2021 - openaccess.thecvf.com
Linguistic knowledge is of great benefit to scene text recognition. However, how to effectively
model linguistic rules in end-to-end deep networks remains a research challenge. In this …

From two to one: A new scene text recognizer with visual language modeling network

Y Wang, H Xie, S Fang, J Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we abandon the dominant complex language model and rethink the linguistic
learning process in the scene text recognition. Different from previous methods considering …

Towards accurate scene text recognition with semantic reasoning networks

D Yu, X Li, C Zhang, T Liu, J Han… - Proceedings of the …, 2020 - openaccess.thecvf.com
Scene text image contains two levels of contents: visual texture and semantic information.
Although the previous scene text recognition methods have made great progress over the …

Robustscanner: Dynamically enhancing positional clues for robust text recognition

X Yue, Z Kuang, C Lin, H Sun, W Zhang - European Conference on …, 2020 - Springer
The attention-based encoder-decoder framework has recently achieved impressive results
for scene text recognition, and many variants have emerged with improvements in …

Abinet++: Autonomous, bidirectional and iterative language modeling for scene text spotting

S Fang, Z Mao, H Xie, Y Wang, C Yan… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Scene text spotting is of great importance to the computer vision community due to its wide
variety of applications. Recent methods attempt to introduce linguistic knowledge for …

Primitive representation learning for scene text recognition

R Yan, L Peng, S Xiao, G Yao - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Scene text recognition is a challenging task due to diverse variations of text instances in
natural scene images. Conventional methods based on CNN-RNN-CTC or encoder …

On recognizing texts of arbitrary shapes with 2D self-attention

J Lee, S Park, J Baek, SJ Oh… - Proceedings of the …, 2020 - openaccess.thecvf.com
Scene text recognition (STR) is the task of recognizing character sequences in natural
scenes. While there have been great advances in STR methods, current methods which …

Context-based contrastive learning for scene text recognition

X Zhang, B Zhu, X Yao, Q Sun, R Li, B Yu - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Pursuing accurate and robust recognizers has been a long-lasting goal for scene text
recognition (STR) researchers. Recently, attention-based methods have demonstrated their …

Cdistnet: Perceiving multi-domain character distance for robust text recognition

T Zheng, Z Chen, S Fang, H Xie, YG Jiang - International Journal of …, 2024 - Springer
The transformer-based encoder-decoder framework is becoming popular in scene text
recognition, largely because it naturally integrates recognition clues from both visual and …

Maskocr: Text recognition with masked encoder-decoder pretraining

P Lyu, C Zhang, S Liu, M Qiao, Y Xu, L Wu… - arXiv preprint arXiv …, 2022 - arxiv.org
Text images contain both visual and linguistic information. However, existing pre-training
techniques for text recognition mainly focus on either visual representation learning or …