UPOCR: Towards unified pixel-level ocr interface

D Peng, Z Yang, J Zhang, C Liu, Y Shi… - … on Machine Learning, 2023 - openreview.net
Existing optical character recognition (OCR) methods rely on task-specific designs with
divergent paradigms, architectures, and training strategies, which significantly increases the …

EAFormer: Scene Text Segmentation with Edge-Aware Transformers

H Yu, T Fu, B Li, X Xue - arXiv preprint arXiv:2407.17020, 2024 - arxiv.org
Scene text segmentation aims at cropping texts from scene images, which is usually used to
help generative models edit or remove texts. The existing text segmentation methods tend to …

Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

M Ye, J Zhang, J Liu, C Liu, B Yin, C Liu, B Du… - arXiv preprint arXiv …, 2024 - arxiv.org
The Segment Anything Model (SAM), a profound vision foundation model pre-trained on a
large-scale dataset, breaks the boundaries of general segmentation and sparks various …

WAS: Dataset and Methods for Artistic Text Segmentation

X Xie, Y Li, Y Liu, Z Zhang, Z Wang, W Xiong… - arXiv preprint arXiv …, 2024 - arxiv.org
Accurate text segmentation results are crucial for text-related generative tasks, such as text
image generation, text editing, text removal, and text style transfer. Recently, some scene …