… poral Classification have also been widely used in scenetextrecognition, in works such as [47, … Question Answering (VQA) aims to come up with an answer to a given natural language …
X Bai, C Yao, W Liu - … on computer vision and pattern recognition, 2014 - cv-foundation.org
… textrecognition in natural scenes (aka scenetextrecognition) … characteristics of the corresponding languages. For example, … could learn a hybrid set of strokelets on multiple …
J Xu, Y Wang, H Xie, Y Zhang - … and Pattern Recognition, 2024 - openaccess.thecvf.com
… a sequence of visual tokens to represent scenetext images, … the entire text image and achieve accurate textrecognition. … and iterative language modeling for scenetextrecognition. …
… for SceneTextRecognition (STR) task. Considering scenetext images carry both visual and … from translation by recognizing foreign languages to street sign recognition for autonomous. …
M Huang, J Zhang, D Peng, H Lu… - … Computer Vision, 2023 - openaccess.thecvf.com
… a vision-language communication module designed to enhance explicit synergy, which utilizes a novel collaborative cross-modal interaction between textdetection and recognition. …
D Yu, X Li, C Zhang, T Liu, J Han… - … pattern recognition, 2020 - openaccess.thecvf.com
… In addition, transformer has been proved to be effective in many tasks of computer vision [11, 36] and natural language processing [34]. In this paper, we not only adopt transformer to …
H Wang, J Liao, T Cheng, Z Gao… - … Recognition, 2022 - openaccess.thecvf.com
… ] as a knowledge-aware language model and apply them to extract … vision-language tasks, they require the annotation of image-… Scenetext retrieval via joint textdetection and similarity …
… demonstrate that VST can achieve higher or competitive prediction accuracy in scene textrecognition without the aid of explicit language models. 2 Visual-Semantic Transformer …
… of different algorithms in learning language prior. Meanwhile, … for developing scenetext recognition algorithms in the future. … Segmentation-based methods can accurately extract visual …