Multi-granularity prediction for scene text recognition

P Wang, C Da, C Yao - European Conference on Computer Vision, 2022 - Springer
Scene text recognition (STR) has been an active research topic in computer vision for years.
To tackle this challenging problem, numerous innovative methods have been successively …

Robust scene text recognition with automatic rectification

B Shi, X Wang, P Lyu, C Yao, X Bai - Proceedings of the IEEE …, 2016 - cv-foundation.org
Recognizing text in natural images is a challenging task with many unsolved problems.
Different from those in documents, words in natural images often possess irregular shapes …

Svtr: Scene text recognition with a single visual model

Y Du, Z Chen, C Jia, X Yin, T Zheng, C Li, Y Du… - arXiv preprint arXiv …, 2022 - arxiv.org
Dominant scene text recognition models commonly contain two building blocks, a visual
model for feature extraction and a sequence model for text transcription. This hybrid …

Synthetically supervised feature learning for scene text recognition

Y Liu, Z Wang, H Jin, I Wassell - Proceedings of the …, 2018 - openaccess.thecvf.com
We address the problem of image feature learning for scene text recognition. The image
features in the state-of-the-art methods are learned from large-scale synthetic image …

What is wrong with scene text recognition model comparisons? dataset and model analysis

J Baek, G Kim, J Lee, S Park, D Han… - Proceedings of the …, 2019 - openaccess.thecvf.com
Many new proposals for scene text recognition (STR) models have been introduced in
recent years. While each claim to have pushed the boundary of the technology, a holistic …

Symmetrical linguistic feature distillation with clip for scene text recognition

Z Wang, H Xie, Y Wang, J Xu, B Zhang… - Proceedings of the 31st …, 2023 - dl.acm.org
In this paper, we explore the potential of the Contrastive Language-Image Pretraining (CLIP)
model in scene text recognition (STR), and establish a novel Symmetrical Linguistic Feature …

Petr: Rethinking the capability of transformer-based language model in scene text recognition

Y Wang, H Xie, S Fang, M Xing, J Wang… - … on Image Processing, 2022 - ieeexplore.ieee.org
The exploration of linguistic information promotes the development of scene text recognition
task. Benefiting from the significance in parallel reasoning and global relationship capture …

Attention and language ensemble for scene text recognition with convolutional sequence modeling

S Fang, H Xie, ZJ Zha, N Sun, J Tan… - Proceedings of the 26th …, 2018 - dl.acm.org
Recent dominant approaches for scene text recognition are mainly based on convolutional
neural network (CNN) and recurrent neural network (RNN), where the CNN processes …

Revisiting scene text recognition: A data perspective

Q Jiang, J Wang, D Peng, C Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
This paper aims to re-assess scene text recognition (STR) from a data-oriented perspective.
We begin by revisiting the six commonly used benchmarks in STR and observe a trend of …

Visual semantics allow for textual reasoning better in scene text recognition

Y He, C Chen, J Zhang, J Liu, F He, C Wang… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Abstract Existing Scene Text Recognition (STR) methods typically use a language model to
optimize the joint probability of the 1D character sequence predicted by a visual recognition …