Vision-language pre-training for boosting scene text detectors

S Song, J Wan, Z Yang, J Tang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recently, vision-language joint representation learning has proven to be highly effective in
various scenarios. In this paper, we specifically adapt vision-language joint learning for …

Turning a clip model into a scene text detector

W Yu, Y Liu, W Hua, D Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent large-scale Contrastive Language-Image Pretraining (CLIP) model has shown
great potential in various downstream tasks via leveraging the pretrained vision and …

MOST: A multi-oriented scene text detector with localization refinement

M He, M Liao, Z Yang, H Zhong… - Proceedings of the …, 2021 - openaccess.thecvf.com
Over the past few years, the field of scene text detection has progressed rapidly that modern
text detectors are able to hunt text in various challenging scenarios. However, they might still …

Wetext: Scene text detection under weak supervision

S Tian, S Lu, C Li - … of the IEEE international conference on …, 2017 - openaccess.thecvf.com
The requiring of large amounts of annotated training data has become a common constraint
on various deep learning systems. In this paper, we propose a weakly supervised scene text …

Textboxes++: A single-shot oriented scene text detector

M Liao, B Shi, X Bai - IEEE transactions on image processing, 2018 - ieeexplore.ieee.org
Scene text detection is an important step of scene text recognition system and also a
challenging problem. Different from general object detections, the main challenges of scene …

Cleval: Character-level evaluation for text detection and recognition tasks

Y Baek, D Nam, S Park, J Lee, S Shin… - Proceedings of the …, 2020 - openaccess.thecvf.com
Despite the recent success of text detection and recognition methods, existing evaluation
metrics fail to provide a fair and reliable comparison among those methods. In addition …

LayoutFormer: Hierarchical Text Detection Towards Scene Text Understanding

M Liang, JW Ma, X Zhu, J Qin… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Existing scene text detectors generally focus on accurately detecting single-level (ie word-
level line-level or paragraph-level) text entities without exploring the relationships among …

Textboxes: A fast text detector with a single deep neural network

M Liao, B Shi, X Bai, X Wang, W Liu - Proceedings of the AAAI …, 2017 - ojs.aaai.org
This paper presents an end-to-end trainable fast scene text detector, named TextBoxes,
which detects scene text with both high accuracy and efficiency in a single network forward …

Character region awareness for text detection

Y Baek, B Lee, D Han, S Yun… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Scene text detection methods based on neural networks have emerged recently and have
shown promising results. Previous methods trained with rigid word-level bounding boxes …

STELA: A real-time scene text detector with learned anchor

L Deng, Y Gong, X Lu, Y Lin, Z Ma, M Xie - IEEE Access, 2019 - ieeexplore.ieee.org
To achieve high coverage of target boxes, a normal strategy of conventional one-stage
anchor-based detectors is to utilize multiple priors at each spatial position, especially in …