Multi-granularity prediction for scene text recognition

P Wang, C Da, C Yao - European Conference on Computer Vision, 2022 - Springer
Scene text recognition (STR) has been an active research topic in computer vision for years.
To tackle this challenging problem, numerous innovative methods have been successively …

Abinet++: Autonomous, bidirectional and iterative language modeling for scene text spotting

S Fang, Z Mao, H Xie, Y Wang, C Yan… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Scene text spotting is of great importance to the computer vision community due to its wide
variety of applications. Recent methods attempt to introduce linguistic knowledge for …

Reading and writing: Discriminative and generative modeling for self-supervised text recognition

M Yang, M Liao, P Lu, J Wang, S Zhu, H Luo… - Proceedings of the 30th …, 2022 - dl.acm.org
Existing text recognition methods usually need large-scale training data. Most of them rely
on synthetic training data due to the lack of annotated real images. However, there is a …

Multi-modal text recognition networks: Interactive enhancements between visual and semantic features

B Na, Y Kim, S Park - European Conference on Computer Vision, 2022 - Springer
Linguistic knowledge has brought great benefits to scene text recognition by providing
semantics to refine character sequences. However, since linguistic knowledge has been …

Levenshtein ocr

C Da, P Wang, C Yao - European Conference on Computer Vision, 2022 - Springer
A novel scene text recognizer based on Vision-Language Transformer (VLT) is presented.
Inspired by Levenshtein Transformer in the area of NLP, the proposed method (named …

LISTER: Neighbor decoding for length-insensitive scene text recognition

C Cheng, P Wang, C Da, Q Zheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
The diversity in length constitutes a significant characteristic of text. Due to the long-tail
distribution of text lengths, most existing methods for scene text recognition (STR) only work …

Multi-view correlation distillation for incremental object detection

D Yang, Y Zhou, A Zhang, X Sun, D Wu, W Wang… - Pattern Recognition, 2022 - Elsevier
In real applications, new object classes often emerge after the detection model has been
trained on a prepared dataset with fixed classes. Fine-tuning the old model with only new …

Maskocr: Text recognition with masked encoder-decoder pretraining

P Lyu, C Zhang, S Liu, M Qiao, Y Xu, L Wu… - arXiv preprint arXiv …, 2022 - arxiv.org
Text images contain both visual and linguistic information. However, existing pre-training
techniques for text recognition mainly focus on either visual representation learning or …

Towards robust real-time scene text detection: From semantic to instance representation learning

X Qin, P Lyu, C Zhang, Y Zhou, K Yao… - Proceedings of the 31st …, 2023 - dl.acm.org
Due to the flexible representation of arbitrary-shaped scene text and simple pipeline, bottom-
up segmentation-based methods begin to be mainstream in real-time scene text detection …

Pure transformer with integrated experts for scene text recognition

YL Tan, AWK Kong, JJ Kim - European Conference on Computer Vision, 2022 - Springer
Scene text recognition (STR) involves the task of reading text in cropped images of natural
scenes. Conventional models in STR employ convolutional neural network (CNN) followed …