Text recognition in the wild: A survey

X Chen, L Jin, Y Zhu, C Luo, T Wang - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …

Traditional to transfer learning progression on scene text detection and recognition: a survey

N Gupta, AS Jalal - Artificial Intelligence Review, 2022 - Springer
Many computer vision-based techniques utilize semantic information ie scene text present in
a natural scene for image analysis. Subsequently, in recent times researchers pay more …

Mask textspotter: An end-to-end trainable neural network for spotting text with arbitrary shapes

P Lyu, M Liao, C Yao, W Wu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Recently, models based on deep neural networks have dominated the fields of scene text
detection and recognition. In this paper, we investigate the problem of scene text spotting …

Arbitrary-oriented scene text detection via rotation proposals

J Ma, W Shao, H Ye, L Wang, H Wang… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
This paper introduces a novel rotation-based framework for arbitrary-oriented text detection
in natural scene images. We present the Rotation Region Proposal Networks, which are …

Multi-oriented scene text detection via corner localization and region segmentation

P Lyu, C Yao, W Wu, S Yan… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Previous deep learning based state-of-the-art scene text detection methods can be roughly
classified into two categories. The first category treats scene text as a type of general objects …

Vista: Vision and scene text aggregation for cross-modal retrieval

M Cheng, Y Sun, L Wang, X Zhu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Visual appearance is considered to be the most important cue to understand images for
cross-modal retrieval, while sometimes the scene text appearing in images can provide …

All you need is boundary: Toward arbitrary-shaped text spotting

H Wang, P Lu, H Zhang, M Yang, X Bai, Y Xu… - Proceedings of the …, 2020 - ojs.aaai.org
Recently, end-to-end text spotting that aims to detect and recognize text from cluttered
images simultaneously has received particularly growing interest in computer vision …

A unified matrix-based convolutional neural network for fine-grained image classification of wheat leaf diseases

Z Lin, S Mu, F Huang, KA Mateen, M Wang… - IEEE …, 2019 - ieeexplore.ieee.org
Fine-grained image classification methods often suffer from the challenge that the
subordinate categories within an entry-level category can only be distinguished by subtle …

A survey of methods, datasets and evaluation metrics for visual question answering

H Sharma, AS Jalal - Image and Vision Computing, 2021 - Elsevier
Abstract Visual Question Answering (VQA) is a multi-disciplinary research problem that has
captured the attention of both computer vision as well as natural language processing …

Street view text recognition with deep learning for urban scene understanding in intelligent transportation systems

C Zhang, W Ding, G Peng, F Fu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Understanding the surrounding scenes is one of the fundamental tasks in intelligent
transportation systems (ITS), especially in unpredictable driving scenes or in developing …