Clipter: Looking at the bigger picture in scene text recognition

A Aberdam, D Bensaïd, A Golts… - Proceedings of the …, 2023 - openaccess.thecvf.com
Reading text in real-world scenarios often requires understanding the context surrounding it,
especially when dealing with poor-quality text. However, current scene text recognizers are …

CLIP4STR: A simple baseline for scene text recognition with pre-trained vision-language model

S Zhao, R Quan, L Zhu, Y Yang - arXiv preprint arXiv:2305.14014, 2023 - arxiv.org
Pre-trained vision-language models~(VLMs) are the de-facto foundation models for various
downstream tasks. However, scene text recognition methods still prefer backbones pre …

Implicit feature alignment: learn to convert text recognizer to text spotter

T Wang, Y Zhu, L Jin, D Peng, Z Li… - Proceedings of the …, 2021 - openaccess.thecvf.com
Text recognition is a popular research subject with many associated challenges. Despite the
considerable progress made in recent years, the text recognition task itself is still …

On vocabulary reliance in scene text recognition

Z Wan, J Zhang, L Zhang, J Luo… - Proceedings of the …, 2020 - openaccess.thecvf.com
The pursuit of high performance on public benchmarks has been the driving force for
research in scene text recognition, and notable progresses have been achieved. However, a …

Revisiting scene text recognition: A data perspective

Q Jiang, J Wang, D Peng, C Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
This paper aims to re-assess scene text recognition (STR) from a data-oriented perspective.
We begin by revisiting the six commonly used benchmarks in STR and observe a trend of …

PMMN: pre-trained multi-modal network for scene text recognition

Y Zhang, Z Fu, F Huang, Y Liu - Pattern Recognition Letters, 2021 - Elsevier
Abstract Scene Text Recognition (STR) task needs to consume large-amount data to
develop a powerful recognizer, including visual data like images and linguistic data like …

Read like humans: Autonomous, bidirectional and iterative language modeling for scene text recognition

S Fang, H Xie, Y Wang, Z Mao… - Proceedings of the …, 2021 - openaccess.thecvf.com
Linguistic knowledge is of great benefit to scene text recognition. However, how to effectively
model linguistic rules in end-to-end deep networks remains a research challenge. In this …

Master: Multi-aspect non-local network for scene text recognition

N Lu, W Yu, X Qi, Y Chen, P Gong, R Xiao, X Bai - Pattern Recognition, 2021 - Elsevier
Attention-based scene text recognizers have gained huge success, which leverages a more
compact intermediate representation to learn 1d-or 2d-attention by a RNN-based encoder …

Kiss: Keeping it simple for scene text recognition

C Bartz, J Bethge, H Yang, C Meinel - arXiv preprint arXiv:1911.08400, 2019 - arxiv.org
Over the past few years, several new methods for scene text recognition have been
proposed. Most of these methods propose novel building blocks for neural networks. These …

Scatter: selective context attentional scene text recognizer

R Litman, O Anschel, S Tsiper… - proceedings of the …, 2020 - openaccess.thecvf.com
Abstract Scene Text Recognition (STR), the task of recognizing text against complex image
backgrounds, is an active area of research. Current state-of-the-art (SOTA) methods still …