DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

J Zhang, D Peng, C Liu, P Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Document image restoration is a crucial aspect of Document AI systems as the quality of
document images significantly influences the overall performance. Prevailing methods …

Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing

Y Shu, W Zeng, Z Li, F Zhao, Y Zhou - arXiv preprint arXiv:2402.03082, 2024 - arxiv.org
Visual text, a pivotal element in both document and scene images, speaks volumes and
attracts significant attention in the computer vision domain. Beyond visual text detection and …

Generalized Tampered Scene Text Detection in the era of Generative AI

C Qu, Y Zhong, F Guo, L Jin - arXiv preprint arXiv:2407.21422, 2024 - arxiv.org
The rapid advancements of generative AI have fueled the potential of generative text image
editing while simultaneously escalating the threat of misinformation spreading. However …

Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

M Ye, J Zhang, J Liu, C Liu, B Yin, C Liu, B Du… - arXiv preprint arXiv …, 2024 - arxiv.org
The Segment Anything Model (SAM), a profound vision foundation model pre-trained on a
large-scale dataset, breaks the boundaries of general segmentation and sparks various …