A Aberdam, D Bensaïd, A Golts… - … Computer Vision, 2023 - openaccess.thecvf.com
… In particular, we explore a range of vision and vision-language image encoders, pooling operators, light-to-heavy fusion schemes, and different integration points between word-level …
Y Wang, H Xie, S Fang, J Wang… - … on Computer Vision, 2021 - openaccess.thecvf.com
… vision model with language capability. Specially, we introduce the textrecognition of character… Such operation guides the vision model to use not only the visual texture of characters, but …
S Song, J Wan, Z Yang, J Tang… - … Recognition, 2022 - openaccess.thecvf.com
… Recently, vision-language joint representation learning has … adapt vision-language joint learning for scenetextdetection, a task … ities: vision and language, since text is the written form of …
… like characterdetection and recognition we provide annotated character bounding boxes. … We address a more general problem of scenetextrecognition, ie recognizing a word without …
R Atienza - … conference on document analysis and recognition, 2021 - Springer
… Scenetextrecognition (STR) enables computers to read text in natural scenes such as object labels, road signs and instructions. STR helps machines perform informed decisions such …
SK Ghosh, E Valveny… - … analysis and recognition …, 2017 - ieeexplore.ieee.org
… language modeling outperforms the state-ofthe-art in unconstrained scenetextrecognition … In this paper we proposed an LSTM-based visual attention model for scenetextrecognition. …
S Fang, H Xie, ZJ Zha, N Sun, J Tan… - Proceedings of the 26th …, 2018 - dl.acm.org
… loss from language aspect, multiple losses from attention and language are accumulated for … on standard datasets for scenetextrecognition, including Street ViewText, IIIT5K and …
P Wang, C Da, C Yao - European Conference on Computer Vision, 2022 - Springer
… language information of text. In order to effectively resort to linguistic information for scenetext recognition… in NLP [7] into textrecognition method. Subword tokenization algorithms aim to …
Y Zhang, Z Fu, F Huang, Y Liu - Pattern Recognition Letters, 2021 - Elsevier
… model and language model respectively to learn modality-specific knowledge for … scene textrecognition. In detail, we first pre-train the proposed off-the-shelf vision model and language …