Deep learning approaches to scene text detection: a comprehensive review

T Khan, R Sarkar, AF Mollah - Artificial Intelligence Review, 2021 - Springer
In recent times, text detection in the wild has significantly raised its ability due to tremendous
success of deep learning models. Applications of computer vision have emerged and got …

Real-time scene text detection with differentiable binarization and adaptive scale fusion

M Liao, Z Zou, Z Wan, C Yao… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …

Mmrotate: A rotated object detection benchmark using pytorch

Y Zhou, X Yang, G Zhang, J Wang, Y Liu… - Proceedings of the 30th …, 2022 - dl.acm.org
We present an open-source toolbox, named MMRotate, which provides a coherent algorithm
framework of training, inferring, and evaluation for the popular rotated object detection …

Trocr: Transformer-based optical character recognition with pre-trained models

M Li, T Lv, J Chen, L Cui, Y Lu, D Florencio… - Proceedings of the …, 2023 - ojs.aaai.org
Text recognition is a long-standing research problem for document digitalization. Existing
approaches are usually built based on CNN for image understanding and RNN for char …

Text detection, recognition, and script identification in natural scene images: A Review

V Naosekpam, N Sahu - International Journal of Multimedia Information …, 2022 - Springer
Text in natural scene images plays a vital role in scene understanding. It contains a rich and
abundant amount of valuable semantic information useful in many applications such as …

Dynamic anchor learning for arbitrary-oriented object detection

Q Ming, Z Zhou, L Miao, H Zhang, L Li - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Arbitrary-oriented objects widely appear in natural scenes, aerial photographs, remote
sensing images, etc., and thus arbitrary-oriented object detection has received considerable …

Fourier contour embedding for arbitrary-shaped text detection

Y Zhu, J Chen, L Liang, Z Kuang… - Proceedings of the …, 2021 - openaccess.thecvf.com
One of the main challenges for arbitrary-shaped text detection is to design a good text
instance representation that allows networks to learn diverse text geometry variances. Most …

Turning a clip model into a scene text detector

W Yu, Y Liu, W Hua, D Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent large-scale Contrastive Language-Image Pretraining (CLIP) model has shown
great potential in various downstream tasks via leveraging the pretrained vision and …

Geolayoutlm: Geometric pre-training for visual information extraction

C Luo, C Cheng, Q Zheng… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Visual information extraction (VIE) plays an important role in Document Intelligence.
Generally, it is divided into two tasks: semantic entity recognition (SER) and relation …

Mask textspotter v3: Segmentation proposal network for robust scene text spotting

M Liao, G Pang, J Huang, T Hassner, X Bai - Computer Vision–ECCV …, 2020 - Springer
Recent end-to-end trainable methods for scene text spotting, integrating detection and
recognition, showed much progress. However, most of the current arbitrary-shape scene text …