scene text recognition vision language- 学术资源搜索

An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition

B Shi, X Bai, C Yao - IEEE transactions on pattern analysis and …, 2016 - ieeexplore.ieee.org

… in computer vision. In this paper, we investigate the problem of scene text recognition, which
is … Alahari, and CV Jawahar, “Scene text recognition using higher order language priors,” in …

被引用次数：3093 相关文章所有 11 个版本

[PDF] thecvf.com

Scene text recognition using part-based tree-structured character detection

C Shi, C Wang, B Xiao, Y Zhang… - … pattern recognition, 2013 - openaccess.thecvf.com

… vision community in recent years. In this paper, we propose a novel scene text recognition
… We use character detection scores, spatial constraints and language model to define the …

被引用次数：215 相关文章所有 11 个版本

Visual matching is enough for scene text retrieval

L Wen, Y Wang, D Zhang, G Chen - … on Web Search and Data Mining, 2023 - dl.acm.org

… text space using text recognition models or project the query … scene text retrieval on MLT, a
dataset covering 9 languages … The fully supervised mode utilizes training sets of all languages…

被引用次数：6 相关文章

Street view text recognition with deep learning for urban scene understanding in intelligent transportation systems

C Zhang, W Ding, G Peng, F Fu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

… language has a critical influence on scene text detection. Moreover, by comparing the accuracy
of four scene text recognition … street view text recognition to fit real-world ITS applications. …

被引用次数：50 相关文章所有 3 个版本

[PDF] arxiv.org

Transfer learning for scene text recognition in Indian languages

S Gunna, R Saluja, CV Jawahar - … on Document Analysis and Recognition, 2021 - Springer

… of deep scene text recognition networks from English to two common Indian languages. We
… The underlying view is that the transfer of image features is standard in deep models, and …

被引用次数：11 相关文章所有 10 个版本

[PDF] researchgate.net

Multimodal Visual-Semantic Representations Learning for Scene Text Recognition

X Gao, Y Pang, Y Liu, M Han, J Yu, W Wang… - ACM Transactions on …, 2024 - dl.acm.org

… Scene Text Recognition methods, which can be generally divided into visual clues driven text
recognizer and multimodal visual… iterative language modeling for scene text recognition. In …

Scene text recognition based on improved CRNN

W Yu, M Ibrayim, A Hamdulla - Information, 2023 - mdpi.com

… , and add a language model to increase … vision. With the continuous development of deep
learning fields such as computer vision, pattern recognition, and machine learning, scene text …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Cmfn: Cross-modal fusion network for irregular scene text recognition

J Zheng, R Ji, L Zhang, Y Wu, C Zhao - International Conference on …, 2023 - Springer

… ) for irregular scene text recognition, which incorporates visual cues … , a visual recognition
branch and an iterative semantic … Our CMFN fuses visual cues in the language module when …

被引用次数：3 相关文章所有 4 个版本

[PDF] umass.edu

Improving open-vocabulary scene text recognition

JL Feild, EG Learned-Miller - … analysis and recognition, 2013 - ieeexplore.ieee.org

… Abstract—This paper presents a system for open-vocabulary text recognition in images … text
in the environment into other languages and improving navigation for people with low vision. …

被引用次数：33 相关文章所有 9 个版本

[PDF] arxiv.org

Lumos: Empowering Multimodal LLMs with Scene Text Recognition

A Shenoy, Y Lu, S Jayakumar, D Chatterjee… - arXiv preprint arXiv …, 2024 - arxiv.org

… Visual question answering has been a research area for … recent progress on LLMs and
vision language pre-training (Multi-… needed for scene understanding, visual understanding and …

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群

An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition

Scene text recognition using part-based tree-structured character detection

Visual matching is enough for scene text retrieval

Street view text recognition with deep learning for urban scene understanding in intelligent transportation systems

Transfer learning for scene text recognition in Indian languages

Multimodal Visual-Semantic Representations Learning for Scene Text Recognition

Scene text recognition based on improved CRNN

Cmfn: Cross-modal fusion network for irregular scene text recognition

Improving open-vocabulary scene text recognition

Lumos: Empowering Multimodal LLMs with Scene Text Recognition

引用