Detrs with hybrid matching

D Jia, Y Yuan, H He, X Wu, H Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com
One-to-one set matching is a key design for DETR to establish its end-to-end capability, so
that object detection does not require a hand-crafted NMS (non-maximum suppression) to …

Towards end-to-end unified scene text detection and layout analysis

S Long, S Qin, D Panteleev… - Proceedings of the …, 2022 - openaccess.thecvf.com
Scene text detection and document layout analysis have long been treated as two separate
tasks in different image domains. In this paper, we bring them together and introduce the …

Few could be better than all: Feature sampling and grouping for scene text detection

J Tang, W Zhang, H Liu, MK Yang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recently, transformer-based methods have achieved promising progresses in object
detection, as they can eliminate the post-processes like NMS and enrich the deep …

A novel degraded document binarization model through vision transformer network

M Yang, S Xu - Information Fusion, 2023 - Elsevier
Degraded document binarization has received keen attention due to its vital influence on
subsequent document analysis tasks. In this study, we propose a novel Degraded Document …

Turning a clip model into a scene text spotter

W Yu, Y Liu, X Zhu, H Cao, X Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
We exploit the potential of the large-scale Contrastive Language-Image Pretraining (CLIP)
model to enhance scene text detection and spotting tasks, transforming it into a robust …

HGR-Net: Hierarchical graph reasoning network for arbitrary shape scene text detection

H Bi, C Xu, C Shi, G Liu, H Zhang, Y Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
As a prerequisite step of scene text reading, scene text detection is known as a challenging
task due to natural scene text diversity and variability. Most existing methods either adopt …

Scene text understanding: recapitulating the past decade

M Ghosh, H Mukherjee, SM Obaidullah, XZ Gao… - Artificial Intelligence …, 2023 - Springer
Computational perception has indeed been dramatically modified and reformed from
handcrafted feature-based techniques to the advent of deep learning. Scene text …

Explore faster localization learning for scene text detection

Y Zhao, Y Cai, W Wu, W Wang - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Generally, pre-training and long-time training computation are necessary for obtaining a
good-performance text detector based on deep networks. In this paper, we present a new …

Arbitrary shape text detection using transformers

Z Raisi, G Younes, J Zelek - 2022 26th International …, 2022 - ieeexplore.ieee.org
Recent text detection frameworks require several handcrafted components such as anchor
generation, non-maximum suppression (NMS), or multiple processing stages (eg label …

An end-to-end model for multi-view scene text recognition

A Banerjee, P Shivakumara, S Bhattacharya, U Pal… - Pattern Recognition, 2024 - Elsevier
Due to the increasing applications of surveillance and monitoring such as person re-
identification, vehicle re-identification and sports events tracking, the necessity of text …