Language matters: A weakly supervised vision-language pre-training approach for scene text detection and spotting

C Xue, W Zhang, Y Hao, S Lu, PHS Torr… - European Conference on …, 2022 - Springer
Abstract Recently, Vision-Language Pre-training (VLP) techniques have greatly benefited
various vision-language tasks by jointly learning visual and textual representations, which …

Abcnet: Real-time scene text spotting with adaptive bezier-curve network

Y Liu, H Chen, C Shen, T He, L Jin… - proceedings of the …, 2020 - openaccess.thecvf.com
Scene text detection and recognition has received increasing research attention. Existing
methods can be roughly categorized into two groups: character-based and segmentation …

Turning a clip model into a scene text detector

W Yu, Y Liu, W Hua, D Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent large-scale Contrastive Language-Image Pretraining (CLIP) model has shown
great potential in various downstream tasks via leveraging the pretrained vision and …

Geometry normalization networks for accurate scene text detection

Y Xu, J Duan, Z Kuang, X Yue, H Sun… - Proceedings of the …, 2019 - openaccess.thecvf.com
Large geometry (eg, orientation) variances are the key challenges in the scene text
detection. In this work, we first conduct experiments to investigate the capacity of networks …

Dptext-detr: Towards better scene text detection with dynamic points in transformer

M Ye, J Zhang, S Zhao, J Liu, B Du, D Tao - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Recently, Transformer-based methods, which predict polygon points or Bezier curve control
points for localizing texts, are popular in scene text detection. However, these methods built …

Multiple attention encoded cascade R-CNN for scene text detection

Y Wu, W Liu, S Wan - Journal of Visual Communication and Image …, 2021 - Elsevier
Inspired by instance segmentation algorithms, researchers have proposed quantity of
segmentation-based methods for text detection, achieving remarkable results on scene text …

I3cl: Intra-and inter-instance collaborative learning for arbitrary-shaped scene text detection

B Du, J Ye, J Zhang, J Liu, D Tao - International Journal of Computer …, 2022 - Springer
Existing methods for arbitrary-shaped text detection in natural scenes face two critical
issues, ie,(1) fracture detections at the gaps in a text instance; and (2) inaccurate detections …

Mask textspotter v3: Segmentation proposal network for robust scene text spotting

M Liao, G Pang, J Huang, T Hassner, X Bai - Computer Vision–ECCV …, 2020 - Springer
Recent end-to-end trainable methods for scene text spotting, integrating detection and
recognition, showed much progress. However, most of the current arbitrary-shape scene text …

Estextspotter: Towards better scene text spotting with explicit synergy in transformer

M Huang, J Zhang, D Peng, H Lu… - Proceedings of the …, 2023 - openaccess.thecvf.com
In recent years, end-to-end scene text spotting approaches are evolving to the Transformer-
based framework. While previous studies have shown the crucial importance of the intrinsic …

R-Net: A relationship network for efficient and accurate scene text detection

Y Wang, H Xie, Z Zha, Y Tian, Z Fu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This paper introduces a novel bi-directional con-volutional framework to cope with the large-
variance scale problem in scene text detection. Due to the lack of scale normalization in …