PMMN: pre-trained multi-modal network for scene text recognition

Y Zhang, Z Fu, F Huang, Y Liu - Pattern Recognition Letters, 2021 - Elsevier
Abstract Scene Text Recognition (STR) task needs to consume large-amount data to
develop a powerful recognizer, including visual data like images and linguistic data like …

RMFPN: end-to-end scene text recognition using multi-feature Pyramid Network

R Mahadshetti, GS Lee, DJ Choi - IEEE Access, 2023 - ieeexplore.ieee.org
Scene text recognition (STR) plays an important role in various computer vision activities.
STR has been a desirable research topic in the computer community, and deep learning …

Towards accurate scene text recognition with semantic reasoning networks

D Yu, X Li, C Zhang, T Liu, J Han… - Proceedings of the …, 2020 - openaccess.thecvf.com
Scene text image contains two levels of contents: visual texture and semantic information.
Although the previous scene text recognition methods have made great progress over the …

SaHAN: Scale-aware hierarchical attention network for scene text recognition

J Zhang, C Luo, L Jin, T Wang, Z Li, W Zhou - Pattern Recognition Letters, 2020 - Elsevier
Scene text recognition has become a research hotspot owing to its abundant semantic
information and various applications. Recent methods of scene text recognition usually …

SLOAN: Scale-adaptive orientation attention network for scene text recognition

P Dai, H Zhang, X Cao - IEEE Transactions on Image …, 2020 - ieeexplore.ieee.org
Scene text recognition, the final step of the scene text reading system, has made impressive
progress based on deep neural networks. However, existing recognition methods devote to …

Scene text recognition via dual-path network with shape-driven attention alignment

Y Hu, B Dong, K Huang, L Ding, W Wang… - ACM Transactions on …, 2024 - dl.acm.org
Scene text recognition (STR), one typical sequence-to-sequence problem, has drawn much
attention recently in multimedia applications. To guarantee good performance, it is essential …

Multimodal Visual-Semantic Representations Learning for Scene Text Recognition

X Gao, Y Pang, Y Liu, M Han, J Yu, W Wang… - ACM Transactions on …, 2024 - dl.acm.org
Scene Text Recognition (STR), the critical step in OCR systems, has attracted much
attention in computer vision. Recent research on modeling textual semantics with Language …

Moran: A multi-object rectified attention network for scene text recognition

C Luo, L Jin, Z Sun - Pattern Recognition, 2019 - Elsevier
Irregular text is widely used. However, it is considerably difficult to recognize because of its
various shapes and distorted patterns. In this paper, we thus propose a multi-object rectified …

A two-level rectification attention network for scene text recognition

L Wu, Y Xu, J Hou, CLP Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Scene text recognition is a challenging task in the computer vision field due to the diversity
of text styles and the complexity of the image backgrounds. In recent decades, numerous …

Pimnet: a parallel, iterative and mimicking network for scene text recognition

Z Qiao, Y Zhou, J Wei, W Wang, Y Zhang… - Proceedings of the 29th …, 2021 - dl.acm.org
Nowadays, scene text recognition has attracted more and more attention due to its various
applications. Most state-of-the-art methods adopt an encoder-decoder framework with …