C Shi, C Wang, B Xiao, Y Zhang… - … pattern recognition, 2013 - openaccess.thecvf.com
… vision community in recent years. In this paper, we propose a novel scenetextrecognition … We use characterdetection scores, spatial constraints and language model to define the …
L Wen, Y Wang, D Zhang, G Chen - … on Web Search and Data Mining, 2023 - dl.acm.org
… text space using textrecognition models or project the query … scenetext retrieval on MLT, a dataset covering 9 languages … The fully supervised mode utilizes training sets of all languages…
C Zhang, W Ding, G Peng, F Fu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
… language has a critical influence on scenetextdetection. Moreover, by comparing the accuracy of four scenetextrecognition … street viewtextrecognition to fit real-world ITS applications. …
S Gunna, R Saluja, CV Jawahar - … on Document Analysis and Recognition, 2021 - Springer
… of deep scenetextrecognition networks from English to two common Indian languages. We … The underlying view is that the transfer of image features is standard in deep models, and …
X Gao, Y Pang, Y Liu, M Han, J Yu, W Wang… - ACM Transactions on …, 2024 - dl.acm.org
… SceneTextRecognition methods, which can be generally divided into visual clues driven text recognizer and multimodal visual… iterative language modeling for scenetextrecognition. In …
W Yu, M Ibrayim, A Hamdulla - Information, 2023 - mdpi.com
… , and add a language model to increase … vision. With the continuous development of deep learning fields such as computer vision, pattern recognition, and machine learning, scenetext …
… ) for irregular scenetextrecognition, which incorporates visual cues … , a visualrecognition branch and an iterative semantic … Our CMFN fuses visual cues in the language module when …
… Abstract—This paper presents a system for open-vocabulary textrecognition in images … text in the environment into other languages and improving navigation for people with low vision. …
… Visual question answering has been a research area for … recent progress on LLMs and visionlanguage pre-training (Multi-… needed for scene understanding, visual understanding and …