UPOCR: Towards unified pixel-level ocr interface

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

UPOCR: Towards unified pixel-level ocr interface

在引用文章中搜索

[PDF] thecvf.com

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

J Zhang, D Peng, C Liu, P Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Document image restoration is a crucial aspect of Document AI systems as the quality of
document images significantly influences the overall performance. Prevailing methods …

Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing

Y Shu, W Zeng, Z Li, F Zhao, Y Zhou - arXiv preprint arXiv:2402.03082, 2024 - arxiv.org

Visual text, a pivotal element in both document and scene images, speaks volumes and
attracts significant attention in the computer vision domain. Beyond visual text detection and …

Generalized Tampered Scene Text Detection in the era of Generative AI

C Qu, Y Zhong, F Guo, L Jin - arXiv preprint arXiv:2407.21422, 2024 - arxiv.org

The rapid advancements of generative AI have fueled the potential of generative text image
editing while simultaneously escalating the threat of misinformation spreading. However …

Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

M Ye, J Zhang, J Liu, C Liu, B Yin, C Liu, B Du… - arXiv preprint arXiv …, 2024 - arxiv.org

The Segment Anything Model (SAM), a profound vision foundation model pre-trained on a
large-scale dataset, breaks the boundaries of general segmentation and sparks various …

高级搜索

QQ 群

UPOCR: Towards unified pixel-level ocr interface

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing

Generalized Tampered Scene Text Detection in the era of Generative AI

Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

引用