An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition

B Shi, X Bai, C Yao - IEEE transactions on pattern analysis and …, 2016 - ieeexplore.ieee.org
… in computer vision. In this paper, we investigate the problem of scene text recognition, which
is … Alahari, and CV Jawahar, “Scene text recognition using higher order language priors,” in …

Scene text recognition using part-based tree-structured character detection

C Shi, C Wang, B Xiao, Y Zhang… - … pattern recognition, 2013 - openaccess.thecvf.com
vision community in recent years. In this paper, we propose a novel scene text recognition
… We use character detection scores, spatial constraints and language model to define the …

Visual matching is enough for scene text retrieval

L Wen, Y Wang, D Zhang, G Chen - … on Web Search and Data Mining, 2023 - dl.acm.org
text space using text recognition models or project the query … scene text retrieval on MLT, a
dataset covering 9 languages … The fully supervised mode utilizes training sets of all languages

Street view text recognition with deep learning for urban scene understanding in intelligent transportation systems

C Zhang, W Ding, G Peng, F Fu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
language has a critical influence on scene text detection. Moreover, by comparing the accuracy
of four scene text recognition … street view text recognition to fit real-world ITS applications. …

Transfer learning for scene text recognition in Indian languages

S Gunna, R Saluja, CV Jawahar - … on Document Analysis and Recognition, 2021 - Springer
… of deep scene text recognition networks from English to two common Indian languages. We
… The underlying view is that the transfer of image features is standard in deep models, and …

Multimodal Visual-Semantic Representations Learning for Scene Text Recognition

X Gao, Y Pang, Y Liu, M Han, J Yu, W Wang… - ACM Transactions on …, 2024 - dl.acm.org
Scene Text Recognition methods, which can be generally divided into visual clues driven text
recognizer and multimodal visual… iterative language modeling for scene text recognition. In …

Scene text recognition based on improved CRNN

W Yu, M Ibrayim, A Hamdulla - Information, 2023 - mdpi.com
… , and add a language model to increase … vision. With the continuous development of deep
learning fields such as computer vision, pattern recognition, and machine learning, scene text

Cmfn: Cross-modal fusion network for irregular scene text recognition

J Zheng, R Ji, L Zhang, Y Wu, C Zhao - International Conference on …, 2023 - Springer
… ) for irregular scene text recognition, which incorporates visual cues … , a visual recognition
branch and an iterative semantic … Our CMFN fuses visual cues in the language module when …

Improving open-vocabulary scene text recognition

JL Feild, EG Learned-Miller - … analysis and recognition, 2013 - ieeexplore.ieee.org
… Abstract—This paper presents a system for open-vocabulary text recognition in images … text
in the environment into other languages and improving navigation for people with low vision. …

Lumos: Empowering Multimodal LLMs with Scene Text Recognition

A Shenoy, Y Lu, S Jayakumar, D Chatterjee… - arXiv preprint arXiv …, 2024 - arxiv.org
Visual question answering has been a research area for … recent progress on LLMs and
vision language pre-training (Multi-… needed for scene understanding, visual understanding and …