Vision transformer for fast and efficient scene text recognition

R Atienza - International conference on document analysis and …, 2021 - Springer
Scene text recognition (STR) enables computers to read text in natural scenes such as
object labels, road signs and instructions. STR helps machines perform informed decisions …

Multi-granularity prediction for scene text recognition

P Wang, C Da, C Yao - European Conference on Computer Vision, 2022 - Springer
Scene text recognition (STR) has been an active research topic in computer vision for years.
To tackle this challenging problem, numerous innovative methods have been successively …

What is wrong with scene text recognition model comparisons? dataset and model analysis

J Baek, G Kim, J Lee, S Park, D Han… - Proceedings of the …, 2019 - openaccess.thecvf.com
Many new proposals for scene text recognition (STR) models have been introduced in
recent years. While each claim to have pushed the boundary of the technology, a holistic …

Convolutional attention networks for scene text recognition

H Xie, S Fang, ZJ Zha, Y Yang, Y Li… - ACM Transactions on …, 2019 - dl.acm.org
In this article, we present Convoluitional Attention Networks (CAN) for unconstrained scene
text recognition. Recent dominant approaches for scene text recognition are mainly based …

Pimnet: a parallel, iterative and mimicking network for scene text recognition

Z Qiao, Y Zhou, J Wei, W Wang, Y Zhang… - Proceedings of the 29th …, 2021 - dl.acm.org
Nowadays, scene text recognition has attracted more and more attention due to its various
applications. Most state-of-the-art methods adopt an encoder-decoder framework with …

Svtr: Scene text recognition with a single visual model

Y Du, Z Chen, C Jia, X Yin, T Zheng, C Li, Y Du… - arXiv preprint arXiv …, 2022 - arxiv.org
Dominant scene text recognition models commonly contain two building blocks, a visual
model for feature extraction and a sequence model for text transcription. This hybrid …

Symmetrical linguistic feature distillation with clip for scene text recognition

Z Wang, H Xie, Y Wang, J Xu, B Zhang… - Proceedings of the 31st …, 2023 - dl.acm.org
In this paper, we explore the potential of the Contrastive Language-Image Pretraining (CLIP)
model in scene text recognition (STR), and establish a novel Symmetrical Linguistic Feature …

Attention and language ensemble for scene text recognition with convolutional sequence modeling

S Fang, H Xie, ZJ Zha, N Sun, J Tan… - Proceedings of the 26th …, 2018 - dl.acm.org
Recent dominant approaches for scene text recognition are mainly based on convolutional
neural network (CNN) and recurrent neural network (RNN), where the CNN processes …

Textscanner: Reading characters in order for robust scene text recognition

Z Wan, M He, H Chen, X Bai, C Yao - … of the AAAI conference on artificial …, 2020 - aaai.org
Driven by deep learning and a large volume of data, scene text recognition has evolved
rapidly in recent years. Formerly, RNN-attention-based methods have dominated this field …

Revisiting scene text recognition: A data perspective

Q Jiang, J Wang, D Peng, C Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
This paper aims to re-assess scene text recognition (STR) from a data-oriented perspective.
We begin by revisiting the six commonly used benchmarks in STR and observe a trend of …