Scene text detection and recognition: The deep learning era

S Long, X He, C Yao - International Journal of Computer Vision, 2021 - Springer
With the rise and development of deep learning, computer vision has been tremendously
transformed and reshaped. As an important research area in computer vision, scene text …

Traditional to transfer learning progression on scene text detection and recognition: a survey

N Gupta, AS Jalal - Artificial Intelligence Review, 2022 - Springer
Many computer vision-based techniques utilize semantic information ie scene text present in
a natural scene for image analysis. Subsequently, in recent times researchers pay more …

Scene text detection and recognition with advances in deep learning: a survey

X Liu, G Meng, C Pan - International Journal on Document Analysis and …, 2019 - Springer
Scene text detection and recognition has become a very active research topic in recent
several years. It can find many applications in reality ranging from navigation for vision …

Beyond OCR+ VQA: Towards end-to-end reading and reasoning for robust and accurate textvqa

G Zeng, Y Zhang, Y Zhou, X Yang, N Jiang, G Zhao… - Pattern Recognition, 2023 - Elsevier
Text-based visual question answering (TextVQA), which answers a visual question by
considering both visual contents and scene texts, has attracted increasing attention recently …

Unambiguous scene text segmentation with referring expression comprehension

X Rong, C Yi, Y Tian - IEEE Transactions on Image Processing, 2019 - ieeexplore.ieee.org
Text instance provides valuable information for the understanding and interpretation of
natural scenes. The rich precise high-level semantics embodied in the text could be …

Mining the displacement of max-pooling for text recognition

Y Zheng, BK Iwana, S Uchida - Pattern Recognition, 2019 - Elsevier
The max-pooling operation in convolutional neural networks (CNNs) downsamples the
feature maps of convolutional layers. However, in doing so, it loses some spatial information …

An end-to-end ocr text re-organization sequence learning for rich-text detail image comprehension

L Li, F Gao, J Bu, Y Wang, Z Yu, Q Zheng - Computer Vision–ECCV 2020 …, 2020 - Springer
Nowadays the description of detailed images helps users know more about the
commodities. With the help of OCR technology, the description text can be detected and …

Improving 3D metric GPR imaging using automated data collection and learning-based processing

J Feng, L Yang, E Hoxha, J Xiao - IEEE Sensors Journal, 2022 - ieeexplore.ieee.org
Ground Penetrating Radar (GPR) is one of the most important non-destructive evaluation
(NDE) devices to detect subsurface objects (ie, rebars, utility pipes) and reconstruct the …

Urdu signboard detection and recognition using deep learning

SY Arafat, N Ashraf, MJ Iqbal, I Ahmad, S Khan… - Multimedia Tools and …, 2022 - Springer
Signboard detection and recognition is an important task in automated context-aware
marketing. Recently many scripting languages like Latin, Japanese, and Chinese have been …

Scene-text oriented referring expression comprehension

Y Bu, L Li, J Xie, Q Liu, Y Cai… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Referring expression comprehension (REC) aims to identify and locate a specific object in
visual scenes referred to by a natural language expression. Existing studies of REC only …